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FOREWORD 


Most of my own work in the field of personality assessment 
was done between 1928 and 1985, at which time such 
rapid developments were taking place that it seemed hardly 
possible to encompass them in a book. However, I attempted 
to provide a systematic survey of verbal tests, questionnaires 
and ratings in a monograph, which was published as a Report 
of the Industrial Health Research Board in 1988. This has 
long been out of print, and Chapters VII-IX of the present 
volume cover much the same ground. I am grateful to the 
Medical Research Couneil and to the Controller of Her 
Majesty’s Stationery Office for permission to reproduce some 
of the material contained in that Report. 

Since the 1930s, personality testing has become more 
stabilized. The exigencies of applied psychology in war-time 
showed that certain methods could be put to immediate use, 
under carefully controlled conditions, whereas many bright 
ideas were quite impracticable. It showed also that so-called 
* clinical ’ methods like the interview and projection techniques, 
in spite of their apparent advantages in yielding insight into 
the personality as an organized whole, are very subjective and 
untrustworthy tools for vocational purposes. Thus the time 
is now ripe for an overall appraisal of the various approaches to 
personality assessment, for noting the more solid achievements 
and the most promising lines for further development, and for 
dismissing the unsuccessful. 

The 1940s have also seen some decline of interest in the 
assessment of individual personalities, and a greater emphasis 
on the psychology of people’s behaviour in social groups. But 
however true it may be that the individual person behaves 
differently according to the structure of the group of which 
he is a member, the problem of assessment still remains. The 
results of most investigations in general and social psychology 
are still affected by personality differences among the people 
being studied, and these need to be measured. The employer 
or teacher, and the educational or vocational psychologist, 
still wish to find out the character, social and emotional quali- 
ties, the attitudes and interests, of prospective pupils, students, 
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and employees. While this book tries to cover all such types of 
assessment, it does not deal specifically with the diagnosis of 
abnormal personalities, nor discuss clinical methods in detail. 
Some use is made of the evidence from tests of neurotic and 
psychotic patients, but chiefly in order to prove the worth 
of the tests among normal adults and children. 

Though there are numerous and valuable reviews of parts of 
the field of personality assessment, and these are listed in the 
short bibliography at the end, no book dealing with the whole 
ofit appears to have been published since Symonds’s Diagnosing 
Personality and Conduct in 1931, and none at all by a British 
author. (Special mention should be made of R. B. Cattell’s 
Description and Measurement of Personality and H. J. Eysenck’s 

imensions of Personality ; but both of these are primarily 
outlines of the writers’ own contributions.) I have tried to 


A many respects. I am 
particularly grateful also to Mark A. May and Henry A. 
Murray for their inspiration, and for the help that they gave 
with my studies while I was in America, È 

Acknowledgements are also due to the following for permission 
to quote illustrative items from published tests :— 

Dr. E. A. Doll, The Training School, Vineland, New Jersey— 
The Vineland Social Maturity Scale. Dr. D. A. Laird, Colegate 
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I 
Introduction 


E bringing up children wisely, and guiding them into suitable 
educational careers and occupations, we have to take 
account of their personality qualities. The doctor will tell us 
their physical capacities and defects ; school examinations and 
psychological tests will give us at least an approximate 
indication of their educational abilities, intelligence and 
aptitudes along special lines. But many a child with high 
intellectual qualifications at 11 years does not fulfil his promise 
in a secondary grammar school through lack of perseverance, 
weak academic interests, or emotional instability. Other 
children who are dull according to tests, or in their school work, 
develop into worthy members of society owing to their sound 
personalities. In Personnel Selection in the British Forces + 
Dr. Parry and the writer have described the success of psycho- 
logical methods of allocating recruits to jobs in the Services 
during the Second World War. But there were many individuals 
who did very much better, or less well, in some employment 
than had been predicted, largely because of the difficulties of 
making accurate personality assessments. For example, a little 
progress was made—but far too little—in diagnosing the men 
with poor morale or neurotic tendencies who were a liability 
to the army, or the potential leaders who would make good 
officers. 

Cannot the psychologist, then, devise some tests of person- 
ality, analogous to tests of intelligence and other abilities, 
which could be applied by the teacher, the vocational or 
personnel officer, or others interested in the future of a pupil 
or employee ? Such tests would be of the utmost value too to 
Child Guidance Clinic workers, and to those concerned with 
abnormal personalities such as delinquents and criminals, 
neurotics or the insane. Much unhappiness and failure, not 
only in school or employment, but also in marriage, might be 

1Cf. Bibliography. N.B.—Other references not listed in footnotes will 
be found in the Bibliography at the que of the book. 
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obviated by scientific personality testing. But though an 
enormous amount of research and experiment on tests has, in 
fact, been carried out by psychologists during the past thirty 
years or so, the answer to our question is indubitably negative. 
We shall see in this book that many personality qualities can 
be measured or diagnosed fairly effectively, but that the methods 
are far too elaborate and time-consuming, or far too dependent 
on the skill and experience of the psychologist, to be generally 
applicable for any practical purpose, or to be used by anyone 
not specially trained. True, it is possible to suggest some 
improvements on the unreliable methods that the average 
layman habitually employs. Moreover, the outlook is more 
hopeful in certain fields, such as the measurement of interests 
and attitudes. But in the writer’s opinion it is safer to be 
pessimistic regarding the future of personality testing in general, 
and one of the objects of this chapter is to explain why. 


DEFINITIONS OF PERSONALITY, CHARACTER 
AND TEMPERAMENT 


Personality has been variously defined, and its nature and 
origins variously explained, as is shown by G. W. Allport in 
his Personality—A Psychological Approach. Here we mean by 
it, simply, what sort of a person is so-and-so, what is he like ? 
Or as R. B. Cattell expresses it—personality is that which 
enables us to predict a person’s behaviour in a given situation. 
While a man’s intelligence, his bodily strength and skills are 
certainly part of his personality, yet the term refers chiefly to 
his emotional and social qualities, together with his drives, 
sentiments and interests. (Note, however, that this is much 
broader than the colloquial usage, whereby an individual is 
sometimes said to ‘ have personality ’ if he is very domineering, 
impressive or attractive.) 

Character is often used synonymously with personality, but 
is usually a more evaluative term. That is, it refers to certain 
traits of personality which are approved or disapproved, such 
as honesty, reliability, integrity, self-control and their opposites. 
The word ‘temperament’ has also received diverse definitions, 
but is most usefully limited to the constitutional and inborn 
factors underlying personality—the instinctive drives, the 
effects of the endocrine glands or other physiological factors on 
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a person’s behaviour, and certain general tendencies which may 
be at least in part hereditarily determined, such as the strength 
or urgency of drives, excitability vs. placidity, and emotional 
instability. We cannot in fact ever observe temperament 
directly, since even in early infancy it is influenced and modified 
by the parents’ or nurse’s handling and other environmental 
factors. Nevertheless quite marked individual differences do 
occur in the personalities of young babies, also among brothers 
or sisters who have apparently been brought up alike. Hence 
the existence of innate temperamental factors seems a reasonable 
hypothesis. (Like personality, temperament often has a 
narrower colloquial usage, namely, the unstable or hysterical 
traits commonly found among artists and actresses. But 
according to our view this is merely one kind of temperament. 
Indeed, very probably it is not innate at all, but is a kind of 
Personality.) 


THE STRUCTURE OF PERSONALITY 


Personality develops, then, from the interaction of the living 
human organism with an environment that frustrates or 
encourages, and conditions its impulses. Psychoanalysts have 
shown how the manner of early handling, feeding, and weaning, 
the love and security that the parents may give or withhold, 
and the sanctions ’ that society imposes, mould the growing 
child. Although their theories are largely unverified, con- 
troversial and unscientific (so that few, if any, definite associa- 
tions are established between particular methods of upbringing 
and later personality traits °), yet we may agree that an 
organized system or structure is built up which includes the 
Conscious sentiments and interests, and the unconscious 
‘mechanisms’ or complexes, and which determines the child’s 
or adult’s behaviour in any situation. Much progress is indeed 
being made through the researches of medical and of ex- 
perimental depth psychologists towards formulating general 


1 Recently Eysenck and Prell have obtained strong evidence, from a 
study of twins, that emotional stability-instability is hereditarily deter- 
mined to at least the same extent as is general intelligence. Eysenck, H. J. 
and Prell, D. B., ‘The Inheritance of Neuroticism : An Experimental 
Study’. J. Ment. Sci., 1951, 97, 441-465. 

2 Cf. Orlansky, H., Bibliography. 
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principles of personality dynamics,1 though this is not the place 
to describe such work, Special mention should be made, how- 


a man’s life-time and to become self-supporting. There is no 
justification for the implication that the motive forces of his 


structure js fairly stable, 
behaviour towards similar situations from time to time. (For 


individuals 15 years later, and found that judges could match 
or identify the one set with the other fairly successfully. In 
terms of the present writer’s matching formula, the consistency 
of personality over this time is represented by a coefficient of 

i complexity of the structure, and the 


at its most irrational (as in the neurotic or psychotic) it is 
logically determined by this structure. Yet it may appear to 
vary inexplicably. The same child may display very different 
characteristics at home, at play with his friends, in school classes 
under different teachers. And we interpret it so diversely 
that experimental investigations rarely yield correlations higher 
een assessments of a person’s traits by 
es or observers, 
esearch on leadership during and since the war has been 
particularly enlightening in showing that this quality of 
ehaviour is not, as it were, a fixed ‘ property ’ of the individual, 
but that it varies according to the kind of social group of which 


* Neilon, P., ‘Shirley’s Babies After Fifteen Years: A Personality 
Study’. J. Genet. Psychol., 1948, 73, 175-188, 
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cere 3 memben and the activities in which the group is engaged. 
GATA relevant studies, such as those of Hartshorne and May 
mer nesty, will be cited in later chapters. But this does not 
an I some American writers have supposed, that personality 
R of vast numbers of independent habits, specific to each 
Nev ion. The consistency is there if we could but trace it. 
de ertheless the traits or qualities of behaviour by which we 
si Scribe people are, it must be admitted, very rough and over- 
puted generalizations. We are far too apt to jump to 
one to ignore the complexities of structure, and to 
Meany that people will always react in certain limited ways— 

at a boy who, say, is caught cheating in class is destined for a 


life of crime, whereas the cheating may have arisen from all 
sorts of motives. Hence experimental research reveals a 
‘halo effect’. If we rate 


tendency that Thorndike ? called the 

or assess a number of people on several presumably distinctive 
traits (say good looks, intelligence, sociability, moral character), 
it is always found that the ratings overlap rather closely. 
Presumably we are influenced, unwittingly, by our general 
good or bad impressions of the people, and so attribute all the 
desirable traits to some, undesirable ones to others, almost 


regardless of their meaning. 


PERSONALITY TRAITS 


ed with attempts to measure 


Since this book is mainly concern 
t define the sense in which 


or judge personality traits, we mus 
the term is used. Trait refers to any characteristic in which 
people differ or vary from one another. But such differences 

as it were. There are physical character- 


istics, and objective features of behaviour such as speed of 
walking, amount of time spent at church or the cinema, ete. 
At the other end of the scale there are the psychoanalyst’s 
mechanisms or complexes, which cannot be directly observed 
at all, but which are inferred as the underlying * ganglia’ of 
personality structure. Personality traits lie between these 
extremes; they are more general qualities of social and 


1 Cf. Carter, L., Haythorn, W., and Howell, M., ‘A Further Investigation 
of the Criteria of Leadership’. J- Abn. Soc. Psychol., 1950, 45, 850-858. 
2 Thorndike, E. L., “A Constant Error in Psychological Ratings ’. 
J. Appl. Psychol., 1920, 4, 25-29. 
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emotional behaviour—the common features which we abstract 
from observing how people differ. And we would regard them 
(like Stagner) as descriptive rather than (like Allport) as 
explanatory. Cattell distinguishes between descriptive or 
“surface ’ traits, and underlying ‘ source’ traits. We would 
accept this, but doubt his claim to be able to determine source 
traits merely by factor analysis (cf. p. 12). 

The scientific study and measurement of traits is exceedingly 
difficult, for several reasons. First, they are mostly very vague 
and ambiguous in meaning, and different people often include 
different modes of behaviour within any one trait. Such things 
as height and weight are objective ; any two observers would 
arrive at practically the same measurements. Mechanical 
ability, arithmetical attainment and the like, also comprise 
certain types of behaviour which can be fairly readily recognized 
and agreed upon by different observers (though even here the 
notorious unreliability of examinations shows that their 
objectivity is limited). Leadership, honesty, persistence, 
introversion, aggressiveness, timidity, etc., are still more com- 
plex. Behaviour which one person interprets as aggressive 
might be called adventurous by another, or limelight exhibi- 
tionism by another. 

Our second difficulty is that they involve subjective interpre- 
tation. They are partly dependent on the observer. His own 
personality and viewpoint both influence what he notices in 
other people’s behaviour, and his interpretations of the traits 
responsible for such behaviour.! For example, a Victorian 
parent sees naughty actions in a child, and attributes these to 
rebelliousness, where a progressive parent either notices nothing, 
or regards the actions as a sign of the child’s initiative. We 
recognize this tendency in everyday life; for example, we do 
not accept A’s judgment of B at its face value if we regard A 
as a prejudiced person. And the more we penetrate behind the 
superficial behaviour characteristics to the underlying motives, 
the more subjective interpretation enters. The novelist, 
dramatist and biographer who excel in depicting personality do 
not merely give us a factual record of a man’s actions over a 
period. They select the most significant incidents, and infer 
or intuit the man’s feelings, desires, fears, ideas, and so integrate 


Cf. Vernon, P. E., ‘The Biosocial Nature of the Personality Trait’. 
Psychol. Rev., 1933, 40, 533-548, 
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the facts into a meaningful whole. Much the same is true of 
the psychoanalyst’s or clinical psychologist’s case study. 

A third point, referred to above, is the artificiality of studying 
personality in isolation from society. This runs counter to 
one of the dominant trends in contemporary psychology. 
Nevertheless the applied psychologist or the layman continually 
makes judgments about individuals as though they possessed 
distinctive traits, attitudes, and interests in almost any social 
context. The extent to which success or failure in a job, or the 
development of neurotic or criminal tendencies, is determined 
by the qualities of the individual or of the group, is a matter 
for experiment rather than for theoretical argument. 

Fourthly, we have seen that any particular piece of behaviour 
depends on such a multiplicity of factors in the personality 
structure and the environment, that no one reacts in accordance 
with a trait all the time. Nevertheless this does not mean that 
it is hopeless to try to measure personality traits or to assess 
people. For it is clear that some individuals behave more 
markedly and frequently than others do in a manner that 
most of us would call, say, timid, and that others are more 
bold or fearless. Of course we cannot measure timidity in 
absolute physical units as we can height, temperature, etc. But 
so long as we can arrange people in rank order for the trait, 
or agree that some are ‘high’, some ‘low’, the essential 
requirements of measurement are met.! The most fruitful 
approach (which originated in May and Hartshorne’s investiga- 
tions of character) has been described by the writer elsewhere 
as the trait-composite method.? 


THE TRAIT-COMPOSITE APPROACH 


Suppose we desire to obtain a measure of an individual’s 
timidity-boldness : we first select a number of situations in 
everyday life to which people react either timidly or boldly, or 
devise special situations or tests which seem likely to bring out 
the trait (cf. Chaps. IV-VI). We must apply these to a large 


1Cf. Banks, C., and Burt, C., ‘ Statistical Analysis in Educational 
Psychology ’. Current Trends in British Psychology (edit. C. A. Mace and 
P. E. Vernon). London: Methuen, 1953. 

*Vernon, P. E., ‘Human Temperament’. Eugen. Rev., 1982, 23, 
825-331. 
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group of comparable people, not only to the one individual, 
and record their responses, since the only way of measuring 
the strength or weakness of his timidity in each situation is by 
seeing how he stands relative to the average and spread of 
scores in such a group, i.e. by reference to a statistical dis- 
tribution. Here, then, we have a series of samples of the 
individuals’ timid or bold behaviour. Next we present them 
with a questionnaire or ‘ paper-and-pencil test,’ which contains, 
say, fifty questions about their past behaviour or their feelings 
under conditions which were likely to stimulate them to fear 
or bravery (cf. Chap. VIII). We get them to check the answer 
to each question which best describes themselves, and if they 
do this frankly one or more scores will be obtained which 
represent a sample of their own opinions about their timidity. 

Thirdly, we ask other people who are well acquainted with 
the individuals to estimate timidity and other related traits on 
some form of rating scale (cf. Chap. VII). Here again we shall 
get better results if they base their assessments on actual 
behaviour that they have observed in the past, rather than on 
their personal subjective impressions. And we should try to get 
hold of acquaintances who will have diverse slants or viewpoints. 
The judgments of schoolfellows may afford one quite useful 
sample, but different samples should be collected from relatives, 
teachers or employers, or others who have observed them in 
various phases of their existence. 

We now possess scores on, say, a dozen different samples of 
behaviour or opinion, and proceed to find by correlation 
methods the extent to which the samples ‘ hang together ° or 
are consistently associated. Thus if we take the schoolfellows’ 
opinions and the questionnaire scores, we shall probably find 
some agreement, but perhaps no higher than a coefficient of 0-4. 
This indicates a slight tendency for those individuals who think 
that they behave timidly to be regarded as timid by their 
friends, although there are many exceptions. Different sets of 
ratings by associates may agree more highly, but this may be 
largely because they have similar biases. That is why it is 


important to get raters from various walks of life, even if the 
inter-correlations sink. 


situations or tests are lik 
average correlation with 
(in the writer’s experience), 
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in view of the previous discussion. Some samples may show 
virtually no agreement because they are so remote from the 
underlying personality structure (e.g. this might happen if we 
took softness of voice or weak handwriting pressure as tests of 
timidity). Other samples may have been affected by some 
unanticipated motive so that they are really reflecting quite 
different traits (e.g. dislike of the experimenter, or some 
unconscious compensatory mechanism, etc.). 

The next step is now obvious, namely, to combine as many 
diverse, yet overlapping, samples as possible. This combination 
of scores is what we call a trait-composite, and this provides us 
with the best available measurement of the trait in which we 
are interested. Samples which fail to correlate positively with 
the rest are eliminated, leaving perhaps only half a dozen or so 
with a fairly high average inter-correlation. But one has to be 
very cautious at this stage, since omission of the more diverse 
samples necessarily narrows the scope of the composite and 
renders it less representative. Thus it would not do to use 
only ratings or opinions, although these would usually overlap 
quite highly. (This is a problem which statisticians have not 
yet solved even in the field of abilities ; and in the personality 
sphere, it looks as if the subjective judgment of psychologists 
and the practicability of the sample measures must largely 
determine the choice.) Provided, however, that we finish with 
half a dozen or more really varied samples, having an average 
inter-correlation of 0.30 or more, we can be satisfied that our 
composite scores have a ‘ theoretical validity ’ of at least 0.85. 

The notion of theoretical validity implies that the perfect 
criterion of our trait is a complete record of all the individual’s 
behaviour of the timid-bold variety, together with his self- 
expressed ideas and wishes, and all the impressions that he 
makes on acquaintances, or interpretations that they offer 
(whether biased or not). Naturally we can never collect all 
this information, but if we have a representative set of over- 
lapping samples, we can predict that other samples would 
overlap similarly, and that the correlation between our com- 
bined samples and the complete survey would amount to 0.85. 
Note that this approach forces us to realise that a test is not ‘a 
miraculous instrument for revealing otherwise unsuspected 
qualities’ (cf. Vernon and Parry). It is either a good or a 
poor sample of the sort of behaviour, or opinion, which goes to 
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constitute some trait. If this viewpoint were more generally 
adopted, less credence would be given to horoscopes, or 
physiognomical signs of personality, or to some of the very 
artificial tests of temperament and personality on which much 
time has been wasted in the past. 

Though no really satisfactory external criterion of the validity 
of a personality test exists, the correlation between any sample 
and such a composite provides us with the best available 
evidence. In many experiments, tests have been validated by 
correlating with associates? ratings. But while this may be a 
useful first step, it really amounts to no more than comparing 
one imperfect sample of the trait with another biased sample. 
The same objection applies to comparisons with personality 
questionnaires. 

Some psychologists would argue, with considerable justifica- 
tion, that this approach is still too vague and subjective to be 
of value, and that it would be better to do without the concept 
of traits. For example, Eysenck starts with the much more 
definite datum—the difference between neurotic patients and 
non-patients, or the difference between hysterie patients and 
dysthymic (anxiety and obsessional) patients, ete. He then 
proceeds to build up composites of tests which differentiate 
the groups as effectively as possible. Many other recognized 
“ syndromes ’ or types of abnormal personality can be and have 
been used as criteria. Again, we can contrast delinquent with 
non-delinquent children, males with females, or children who 
succeed in grammar school with others (of the same intelligence 
and social background) who fail, and likewise develop batteries 
of predictive measures. This approach too, however, does not 
Provide a sure and complete solution, for several reasons. First, 
it is doubtful how far the syndromes are distinctive; many 
different, yet overlapping, classifications of mental patients 
are possible. Neither schizophrenics, hysterics, nor juvenile 
delinquents constitute homogeneous or clear-cut groups. 

* Several psychologists have applied factor analysis in an attempt to 


settle this problem. Unfortunately no two arrive at the same classification. 
Cf. Moore, T. V., ‘The Essen’ 


tial Psychoses and their Fundamental 
Syndromes’. Stud. Psychol. Psychiat. Cath. Univ. Amer., 1933, 8, 1-128. 
Burt, C. L., * The Analysis of Temperament °, Brit. J. Med. Psychol., 1938, 
17, 158-188. Eysenck, H. J., see Bibliography. Wittenborn, J. R., 
“ Symptom Patterns in a Group of Mental Hospital Patients’. J. Consult. 
Psychol., 1951, 15, 290-302. 
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Secondly, psychiatrie diagnoses are subjective and unreliable, 
just like associates’ ratings. Thirdly, there are numerous traits 
that we would like to measure which cannot be anchored to any 
syndrome. It is reasonable to regard mental patients as 
representing the extremes of certain normal personality 
tendencies. But where are we to find extreme cases of, say, 
tolerance vs. prejudice, impulsiveness-cautiousness, persistence, 
sense of humour, and soon? Actually, both the trait-composite 
and the group-difference approaches are valuable, and should 
be regarded as complementary. A trait-composite for emotional 
stability might be based largely on measures which differentiate 
neurotics from normals and vice versa, one for persistence on 
tests which help to predict scholastic or job success. 


FACTOR ANALYSIS IN THE FIELD OF PERSONALITY 


The relations of these methods to factor analysis require 
some consideration. By statistical treatment of the correlations 
between personality tests or ratings it is possible to determine 
the common elements or factors, in the same way that g (general 
intellectual factor) is extracted from intelligence tests, v from 
verbal tests, k or S from spatial tests, and so on.! If we are 
studying two or more trait-composites or syndromes simultane- 
ously, factorization will show whether they are independent, or 
whether they would be better combined or divided up differently. 
It helps, then, in mapping out the sphere of personality more 
parsimoniously, so that we do not waste our efforts in con- 
structing composites for a lot of traits which come to much the 
same thing (e.g. boldness, aggressiveness, leadership, impulsive- 
ness, initiative, self-confidence and sociability probably overlap 
considerably, and factorization might reduce them to a small 
number of more fundamental dimensions). Also factorization 
enables us to arrive at more accurate weighted trait-composite 
scores than the simple sum of results on a set of samples. 

But statistically established factors may either be broad in 
content like our trait-composites, or they may be narrow. Thus 


1 Cf. Vernon, P. E., The Structure of Human Abilities. London : Methuen, 
1950. For surveys of factor analysis in the field of personality, see Cattell, 
R. B., Description and Measurement of Personality. London: Harrap, 
1946. Eysenck, H. J., The Structure of Personality. London: Methuen, 
1953. 
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a combination of several tests of reaction time would give a 
clear factor, but this would be a very unimportant personality 
trait. Indeed the majority of factorial studies which have been 
made in the field of personality have been conducted with 
limited kinds of samples—associates’ ratings, self-rating 
questionnaires, ete. Cattell has surveyed these studies very 
fully and has put forward a list of a dozen primary factors or 
“source traits’ from his own analyses of ratings. But his 
attempts to equate others’ results with his own involve con- 
siderable guesswork, and in later investigations where he did 
apply objective tests and questionnaires, these failed for the 
most part to group themselves convincingly under his primary 
factors.1 

There is also the difficulty that an infinite number of solutions 
exist to any factorial problem. Cattell tries to decide the most 
appropriate set of rotations by Thurstone’s criterion of Simple 
Structure, Eysenck by linking his factors to criterion groups 
(neurotics vs. normals, ete.). But it is still true to say that no 
two leading psychologists agree as to which are the best 
dimensions. Factors vary again in different groups. Most 
Investigations have been done with male college students. 
But we could not expect, nor do we usually find, the same 
patterning among women, among unselected adults, among 
neurotic or other patients, or among children of different ages. 
Nevertheless it is possible to discover a modicum of agreement 
on two major dimensions, or at least a good deal of over- 
lapping between different authors. Fig. 1 attempts to give 
a rough portrayal of such results. In accordance with 
factorial practice, traits shown at right angles are independent 
or uncorrelated, and 
other are closely correlated. More than two dimensions are 
Teally needed, and dotted lines are intended to indicate 
spokes that project in various dimensions from the plane of 


The most pervasive or far-reaching dimension might be 
termed dependability—a blend of persistence, purposiveness, 


* Cattell, R. B., ‘ Primary Personality Factors in the Realm of Objective 
Tests mal Person., 1948, 16, 459-487. Cattell, R. B., and Saunders, D. R., 
Inter-relation and Matching of Personality Factors from Behavior 


Ratings, Questionnaires, and Objecti T > Hol: 
1950, 81, 243-260. jective Test Data’. J, Soc. Psychol., 
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stability and good character. As early as 1915 Webb ! described 
such a factor—W or will-persistence, though this was based on 
ratings and was therefore certainly contaminated with halo or 
general good impression. Cattell’s Factor C is similar; it 
combines such traits as emotionally stable, realistic and 
persevering. In another careful analysis of ratings, Reyburn 
and Raath? found separate Stable-Mature-Balanced and 
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Cautious 


7 Schizo- Dysthymic 


Desurgent 


INTROVERT 
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Fig. 1.—Diagram of Relations between Main Personality Dimensions. 


Persistence factors, but these were quite closely correlated. 
Conscientious Effort was the most important of the syndromes 
arrived at in Sanford’s elaborate research with children. He 
attributed it to harmonious and integrated development of the 
Ego and Super-ego. More objective backing is forthcoming from 


1 Webb, E., ‘ Character and Intelligence’. Brit. J. Psychol. Monogr. 


Suppl., 1915, No. 3. r AT 
2 Reyburn, H. A., and Raath, M, J., ‘ Primary Factors of Personality ’. 


Brit. J. Psychol., Statist. Sec., 1950, 3, 150-158. ; 
3 Sanford, R. N., et. al., ‘Physique, Personality, and Scholarship °. 


Monogr. Soc. Res. Child Devel., 1943, 8, No. 34. 
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Hartshorne and May’s work. For though the correlations 
between their measures of honesty, persistence, self-control, 
co-operation, and consistency were low, yet there was a common 
element, which Maller 1 identified as ‘ readiness to forgo an 
immediate gain for the sake of a more remote but greater 
gain’. (Actually persistence showed least overlap with the 
other traits mentioned ; hence the ‘ good character ’ aspect of 
dependability is shown by a dotted line on our diagram.) 
Brogden ? factorized 29 character tests, and though he rotated 


them into half a dozen factors, it is clear that there must have 


been quite a strong common element or general factor. Several 
other researches have been done on persistence tests (ef. p. 83), 
and though here too some authors find very little overlapping, 
and others break down the trait into distinct group-factors or 
types of persistence, Ryans concludes that a general factor is 
justified. In a very thorough research on London secondary 
school boys, MacArthur 3 recently demonstrated a strong 
persistence factor in objective tests and ratings (together with 
subsidiary group-factors), and found that this, as well as 
intelligence, contributed to school achievement. Our notion of 
dependability corresponds, indeed, to Alexander’s 4 X-factor of 
industriousness which, as shown elsewhere,5 so largely deter- 
mines educational and vocational success. It links too with 
Charlotte Bühler’s conception of the ‘ work attitude’ among 
young children and with the ‘sense of duty’ which, she 
considers, differentiates older and more mature from younger 
adults.s Finally our factor certainly overlaps with Eysenck’s 
neuroticism factor, which was based on the contrast between 
normals and neurotic patients. For some of his best differ- 
entiating tests are usually accepted as tests of persistence. 


1 Maller, J. B., ‘ General and Specific F; i ter’. J. Soc 
Psychol., 1934, 5, 97-102. pecific Factors in Characte: 


H. E., ‘A Factor Analvsi ter Tests '- 
Psychol. Monogr., 1940, 52 Bea of Forty Charac 


the Secondary School MEPR is, University of 
London, 1951. ges @Ro0l Zeven, 5 PBT), (nesis Un 


+ Alexander, W, P., < Intelligence, Concrete and Abstract’. Brit. J. 
Psychol. Monogr. Suppl., 1935, No. 19. 
aan, op. cit., p. 11, p 
ühler, C., From Birth to Maturity. London: Kegan Paul, 1935. 
p en E., * Studies in Biographical Psychology ’, Char Person., 1936, 
, 1-84, 
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Whether Burt’s ! factor of general emotionality—obtained from 
observations and ratings of children’s emotional reactions—can 
also be identified with undependability is more doubtful; it 
is therefore shown in Fig. 1 on a different plane. Note that our 
factor is not entirely independent of general intelligence (cf. p. 
71), but we should need yet another plane to portray this. 
The underlying core of the second main dimension, orthogonal 
to or independent from the first, is more controversial. For 
though it corresponds to the familiar extravert-introvert 
dichotomy, this conception has been variously interpreted. 
Jung’s original notion of objective vs. subjective orientation 
is difficult to pin down, and we would suggest that the most 
clear-cut dimension is social-co-operative-liking people vs. un- 
sociable. Factorial studies of personality questionnaires show 
that this type of introversion overlaps largely with neurotic 
tendencies.2 Hence the angle on our Figuré between Intro- 
version and Instability is acute ; while Eysenck’s own introvert 
dimension (derived from dysthymic vs. hysterie patients) is 
drawn at 90 degrees to neuroticism. Cattell’s Factor F of 
Surgency (optimistic, sociable vs. melancholic, seclusive) is very 
close to our conception, as also are dichotomies based on manic 
vs. depressed patients, or on the Freudian oral-gratified and 
ungratified types.? Banks’s* two major factors, obtained 
from ratings of women students, are emotional stability and 
‘euthymic-dysthymic’. Reyburn and Raath’s 5 first factor was 
Spontaneity-Cheerful-Sociable, and Fiske ® found a first factor 
1 Burt himself regards it as the innate temperamental element under- 
lying (the inverse of) Webb’s acquired character factor, W. It seems to be 
more akin to Cattell’s Factor D-hypomanic vs. phlegmatic. Cf. Burt, C. L., 
“The Factorial Analysis of Emotional Traits’. Char. & Person., 1989, 7, 


288-254, 285-299. 
2Cf. Eysenck, Bibliography ; Vernon, P. E., ‘The Assessment of 
Psychological Qualities by Verbal Methods’. Industr. Hlth. Res. Board 
Rep., No. 83. London : H.M. Stat. Off., 1938. : 
3 Goldman, F., ‘ Breast-feeding and Character Formation’. J. Person., 


1948, 17, 83-108. F 
i Banks, C., ‘ Primary Personality Factors in Women: A Re-analysis ’. 


Brit. J. Psychol., Statist. Sec., 1948, 1, 204-218. 


5Op cit. s 2 
the Factorial Structures of Personality 


* Fiske, D. W., ‘ Consistency of 
Ratings from Different Sources > J. Abn. Soc. Psychol., 1949, 44, 329-844. 
In this investigation the most prominent second and third factors were 


named Emotional Maturity and Conformity. Our conception of 
Dependability partakes of both of these. 
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of Social Adaptability in ratings of clinical psychology students 
by staff, other students, and themselves, Burt, however, 
distinguishes two bipolar emotional factors—sthenic vs. asthenic 
and pleasant vs. unpleasant. Kretschmer’s 2 cyclothyme- 
schizothyme typology is even more difficult to place. It over- 
laps with Jung’s description of extravert-introvert in many 
Tespects, but is quite different in others. Moreover it is not, 
as Kretschmer claimed, continuous with the distinction between 
manic-depressive insanity and schizophrenia (cf. p. 36). 
Cattell distinguishes it sharply from surgency, but as his cyclo- 
thyme Factor A is based on the traits Easygoing, Frank, 
Co-operative vs, Obstructive, Reserved, it seems impossible to 
disregard the overlapping. We have, therefore, shown it on 


Other traits which are certainly connected with, but should 
probably be considered 7 
are dominance-submissiveness (Cattell’s Factor E), and impul- 
siveness-caution. There seems to be an obscure relation also 
with the masculine-feminine dichotomy, with the practical- 
verbal (k:m vs. v:ed) factor in abilities, and with types of 
interests,3 Obviously the situation is very confused. It might 
be cleared up by sub-dividing Dependability and Extraversion- 
Introversion into group factors, or by the addition of other 
independent dimensions. But there seems to be little prospect 
of different factorists agreeing. At least this brief survey has 
indicated that factor analysis is not yet in a position to supply 
a complete and acceptable map of personality. Though an 


be firmly established and measured b 


y practicable batteries of 
tests and ratings. Not onl 


y would it provide a foundation for 
studies, but also it would cover 2 


: ded in educational and vocational 
selection and guidance, 


1 Op cit. 
isan tetschmer, E., P hysique and Character. London: Kegan Paul, 
De 
3 Cf. North, R. D., «A 


n Analysis of the Personality Dimensions of 


Introversion-Extroversion 7: Person., 1949, 17, 352-368. 
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PERSONALITY TYPES 


Mention should also be made of the German typological 
approach. Representative doctrines are those of Jung with his 
sensation, thinking, feeling, and intuition as well as extravert- 
introvert, types; Kretschmer’s cyclothyme-schizothyme ; 
Jaensch’s T and B, integrate and disintegrate, and the syn- 
thetic-analytic types of older writers ; and Spranger’s types of 
value (cf. p. 162). Such types are quite different from trait- 
composites or syndromes, that is, ‘clusters of behaviour samples 
which are known to inter-correlate.! They are admitted to be 
idealizations or intuitive generalizations, whose value resides 
in-their bringing together a whole set of phenomena and showing 
how they interact in the hypothetical typical person. They 
help in the understanding rather than in the exact description 
or measurement of people. For example, Kretschmer ? inter- 
relates physique, psychopathological syndromes, normal 
temperamental variations, certain characteristics of perception, 
imagery and movement, and styles of artistic and literary 
productions. But no one individual is supposed to show all 
these aspects, any more than we expect to meet pure and 
complete examples of such types as the aesthete, the Aberdonian 
or the absent-minded professor. The latter are referred to as 
‘stereotypes’ by social psychologists ; they are very similar to 
the German concept, though they possess even less logical or 
empirical backing. Kretschmer, Jaensch and their collaborators 
have carried out many experiments which claim to show 
differences between groups of people representing different 
types, though they seldom apply statistical tests of significance, 
or correlation techniques, for demonstrating just how closely 
the characteristics are linked. The present writer has surveyed 
some of the literature elsewhere, and shown how chaotic are 


1They can, however, be expressed quantitatively by means of what 
Stephenson calls Q-technique. Stephenson shows also that there is no 
validity in the common objection that most of the traits or attributes of 
which a type is composed tend to be normally, not bimodally, distributed. 
Stephenson, W., ‘ Methodological Considerations of Jung’s Typology ’. 
J. Ment. Sci., 1939, 85, 185-205. Introduction to Q-Technique. Univ. 
Chicago Psychol. Dept. (mimeographed), 1951. 

2 For a sympathetic appraisal of Kretschmerian typology, see Eysenck, 
H. J., ‘Cyclothymia and Schizothymia as a Dimension of Personality. 
I. Historical Review.’ J. Person., 1950, 19, 123-152. 
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the results, due to the lack of exact specification of the persons 
and situations investigated, and of proper experimental con- 
trols.! Such work, together with the writings of Jung and 
Spranger, is full of interesting insights and hypotheses which 
would be worth following up by objective research. But it 
appears to contribute even less to the scientific assessment of 
personality than does the trait-composite, the syndrome or 
group-difference, or the factorial approaches. ? 
This brings us to the extremely important and controversial 
question of how far personality can be measured. Is it not a 
Gestalt or totality, which is disrupted by our breaking it down 
into separate traits or factors? ‘The mere enumeration of a 
person’s traits and habits does not give us the person himself, 
since it omits the essential aspect of organized structure. Each 
single characteristic has to be considered in relation to the 
whole.’ Allport has put the arguments very clearly for the 
idiographic, clinical, or intuitive approach to personality, as 
opposed to the nomothetie, analytic and psychometric approach; 
while Eysenck has forcefully supported the nomothetic 
position. Now we would agree that the clinical standpoint 
always operates in everyday life; we see the individual as 4 
unique and integrated whole, and we make our predictions about 
him, or advise him, in the light of our understanding of his past 
history and present structure. Every biographer or novelist, 
every vocational or medical psychologist who is concerned with 
personalities, does the same; and they would regard a cross- 
section of a person’s measurements on a series of separate 
traits as entirely inadequate. But unfortunately the factual 
evidence, some of which is surveyed in our next chapter, seems 
to show that every layman’s or psychologist’s insight into an 
individual’s personality is different. The “clinical” approach 
is so subjective and unreliable that more accurate predictions 
can often be made by narrower, though more objective, methods. 
The present writer would not accept Eysenck’s extreme position, 
since the objective methods themselves give very variable and far 
from certain results. But he would urge that clinical judgments 
would always be assisted, and often corrected, by making use g; 
1 Vernon, P. E., 


“The Rorschach Inkblot Test’, Brit. J. Med. Psychol- 
1933, 13, 179-200. = oe 


* Vernon, P. E., ‘ Can the “ Total Personality ” Be Studied Objectively? ; 
Char. & Person., 1935, 4, 1-10. 
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such objective techniques and scientific tests as are available. 
Moreover, he would reject the view that these two approaches 
must necessarily be opposed. On the one hand, most of the h: 
theses about personality to be investigated, and most of the ideas 
for possible tests, derive from the clinical (medical or vocational) 
psychologist, or from everyday-life observations. On the other 
hand, the psychometrist can devise appropriate objective 
techniques for validating or disproving these hypotheses. 

For example, the lay or psychological interviewer can be 
given full freedom to use his ordinary methods of enquiry, and 
to apply his insight, and yet express his conclusions in terms of 
predictions, say, of success in some job, which can then be 
validated in the same way as a test. The matching method, 
described in Chap. IV, enables correlations to be calculated 
between two sets of ‘ wholistic’ data, such as impressions of 
the voice or of handwriting, and sketches of the personalities. 
Even the uniqueness of every personality could be covered, 
theoretically at least, by the standard psychometric approach ; 
for if sufficient traits were measured, each personality would 
show a different pattern or profile of scores. Cronbach 1 has 
suggested one technique for comparing patterns of scores, as 
distinct from isolated variables, and many other statisticians 
are interested in this problem.? Especially fruitful are the 
techniques based on ‘correlations-between-persons’ or 
© between-occasions ’ (Q- and P-techniques) developed by Burt, 
Stephenson, and Cattell,® since these make possible the statis- 
tical treatment of structural properties of individual person- 
alities. Not only cross-sectional but also genetic-dynamic 
patterns can be formulated quantitatively. With the influx, 
especially in America, of laboratory-trained psychologists into 
the fields of clinical and consulting psychology, we may hope for 
more rapid advances in bringing the two approaches together. 


1 Cronbach, L. J., ‘“ Pattern Tabulation”: A Statistical Method for 
Analysis of Limited Patterns of Scores, with Particular Reference to the 
Rorschach Test”. Educ. Psychol. Measmt., 1949, 9, 149-171. 

2 Cf. Banks, C., and Burt, C., ‘ Statistical Analysis in Educational 
Psychology’. Current Trends in British Psychology (edit. C. A. Mace and 
P. E. Vernon). London: Methuen, 1953. 

3 Burt, C. L., and Watson, H., ‘ Factor Analysis of Assessments for a 
Single Person’. Brit. J. Psychol., Statist. Sec., 1951, 4, 179-192. Cattell, 
R. B., Personality. New York: McGraw-Hill, 1950. Stephenson, W., 
op. cit., p. 17. 
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The Interview: Its Reliability and Validity 
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interviewer to jump to false conclusions, and to be influenced 
by prejudices and unsound theories. A notorious instance of 
this was described by Rice.! Two interviewers tried to discover 
the reasons for the destitution of large numbers of persons 
applying for relief. One, a socialist, found that 39% of his 
cases were attributable to industrial conditions, 22% to drink. 
The other, an advocate of temperance, found 7% and 62% 
respectively under these headings. 

The selection interview is obviously unsatisfactory, too, 
because it provides such an unrepresentative and limited sample 
of the interviewee’s behaviour. He is keyed up to make a 
good impression, and subjected to a situation very different 
from that commonly occurring in his later school career or 
job. But in spite of all the adverse evidence the interview is 
likely to remain the major technique for the following reasons. 
First, it is almost universally accepted not only by employers 
and others who require assessments of personality, but also by 
people who are to be assessed. The former would resent being 
unable to exercise their own powers of ‘summing up ’, and the 
latter are suspicious of more impersonal (even if more efficient) 
techniques such as tests, examinations, or written information. 
Secondly, it is quicker and more economical than any method 
involving tests, and also more flexible or adaptable to new 
purposes or to special circumstances. The development and 
validation of a battery of tests involves protracted work by 
skilled psychologists, and if candidates of a different age or 
different educational or social background come forward, the 
whole process needs to be repeated. Tests and other methods 
more objective than the interview can certainly be of great ue 
in research on personality, but it will rarely be Pee eee ale 
them for any practical purpose other than supp!e™ g 
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id—the extent to which the method prediets, or correlates 
wia some external criterion. As shown in Chap. I, the validity 
of a test or interview can be investigated : (a) by its correlations 
with other samples of behaviour or opinion (e.g. ratings) which 
are presumed to cover the same trait or quality 3, (b) by its 
correlation with a trait-composite or factor ; (c) by its capacity 
to discriminate between groups known on other grounds to 
differ in personality, such as different types of neurotic patients, 


for predicting anything else. ests 
j different kinds of reliability and unreliability, 


Sequent chapters the type tha 
be clear from the context. 


, the personalities or traits of a 
group of individuals or ‘ subjects’. This applies to interview 


assessments, patients by psychiatrists, 


to ratings, of personality based on 
Projection tests, or expressive movements. 


2. The agreement between two 


g the responses of subjects to projection 
tests into the same categori i 
the behaviour or the responses). 


3. The agreement between scores or ratings received by the 
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times ‚applied to “time-sampling ’ (p. 94), where records of 
behaviour on alternate days over a considerable period may be 
inter-correlated. This type of reliability—better termed 
eonsisteney—chiefly applies to objective tests and question- 
naires, and to certain projection test scores (e.g. Rorschach). 
It can be studied too when a judge gives ratings on several 
items presumed to cover the same trait, as in third-person 
questionnaires (p. 109). 

5. A number of tests such as the Haggerty-Olson-Wickman 
Behavior Schedules, Kent-Rosanoff word association, Strong 
Interest Blank, ete. are scored by the resemblance of the 
subject’s responses to those of a given group (mental patients, 
people in different vocations, ete.). Such scores tend to be 
unstable unless the standardization group was very large. In 
this instance, reliability can best be established by what is 
called cross-validation. Two groups of, say, mental patients 
are used, and two scoring keys developed which differentiate 
them from normal persons. Group I is then scored on Group 
Il’s key, and vice versa. The reliability of the keys is shown by 
the extent to which each group is differentiated from normals 
by the other’s key. It is generally recognized nowadays that 
the same kind of procedure should be applied in all validatory 
studies, particularly of batteries of tests. Validities established 
within a single group only are often very unreliable. 


RELIABILITY OF THE INTERVIEW 


Obviously no method of personality assessment can be valid 
unless it is first reasonably reliable in any or all of the above 
senses. No exact standards can be laid down, but lower 
coefficients of reliability are normally accepted in the field of 
personality than in that of abilities. Only Type 2 correlations 
are expected to approximate to 1:0. For the other types, °70 
upwards would be considered satisfactory, and -50 to :70 as low 
but passable. Coefficients of -85 and over are suspiciously 
high, unless obtained from exceptionally thorough tests or from 
the combined judgments of several persons. They may indicate 
that the method is assessing some rather artificial or stereotyped 
quality which possesses poor validity. 

Hollingworth carried out one of the first studies of the 
reliability (Type 1) of interviewing. Twelve sales managers 
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interviewed 57 applicants independently and ranked them on: 
“ suitability for the Position in question °, 
applicant received rankings most of the way from the top to the 


correlations, Hartog and Rhodes’s 1 stu 


16 candidates by two boards of experienced Civil Service 
examiners deserves particular notice, since it was typical of 
much employment or scholastic interviewing. Each board had 
the candidates? educational records, and questioned each 
candidate for quarter to halfan hour. The four or five members 
recorded separate assessments of aler 


Suitability of personality for the Ci 
discussion reach, 


there was fairly close agreement, but the two boards differed 


marks being only 
Pancy in marks was 12 %, but it 
ranged from 1% to 31% for different candidates. 


Other studies by Webb, Magson, Fearing, Cantril, Hill and 
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This reliability is low, but it 


2 und for ratings of personal 
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1 Hartog, P. and Rhodes, E, C An Examinati inati 
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London : Macmillan, 1935. x $ a i 


2 ewman, S. H., Bobbitt, J. M., and C 
of the Interview Method in an Officer C: 
Amer. Psychologist, 1946, 1, 103-1 9. 
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Quite commonly there is disagreement on 50% of cases.* 
Large variations are also found in the proportions of cases 
(from comparable populations) which different psychiatrists 
put into the various psychopathological groups. 


VALIDITY OF THE INTERVIEW 


The evidence regarding validity is still less favourable, 
though Vernon and Parry quote studies where particular 
psychologists or personnel selection officers (PSOs) were notably 
successful in picking recruits for Service employments or for 
officer training. Usually the PSO in the Army or Navy had a 
record of the candidate’s success on a number of intelligence, 
educational, or aptitude tests, and his answers to a biographical 
questionnaire. After interviewing him for about quarter of an 
hour, he summarized his judgments of suitability in a final 
recommendation and, in some, experiments, assessed his 
likelihood of success in that employment. The remarkable 
result of several such investigations was that the PSOs’ pre- 
dictions, although combining the indications from tests with 
judgments of experience, personality and interests, were on the 
average less valid than predictions based on the best tests 
alone. Since there were big individual differences in the 
selecting ability of different PSOs, it follows that several of 
them must have given much worse predictions than the tests. 
Stuit ? quotes similar results from the U.S. Navy where inter- 
viewers did no better, or even worse, than ability tests. (No 
personality tests were used in these studies.) 

Similarly McClelland ® followed up pupils in Dundee secondary 
schools and showed that, if account had been taken of primary 
school teachers’ judgments of industriousness or other person- 
ality qualities, the number of instances of bad selection would 
not have been reduced. No investigation seems to have been 
made of grammar school headmasters’ interviews, but they 
would hardly be likely to do better than the teachers who had 
P 1 Cf. Ash, P., ‘ The Reliability of Psychiatrie Diagnoses ’. J. Abn. Soc. 

Sychol., 1949, 44, 272-276. arch and Test Development in the 


2 Stuit, D. B. (edit.), Personnel Rese : t ın I 
ureau of en Personnel. Princeton, N.J.: Princeton University 


Press, 1947. ; 3 
3 McClelland, W., Selection for Secondary Education, London : University 


of London Press, 1942. 
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inst degree marks 1 and 2 years 
trance examination gave small 


of success, and a battery of aptitude tests 
predictions. 


always be accepted at their face 
value, for two technical reasons, First, the accepted candidates 
ve been actually selected chiefly on 
the basis of the Interview. In sucha selected group the validity 
idi ests, ete., which were 
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ae Personality tests yet made is that of Kelly and Fiske into 
the selection of Candidates for training as clinical psychologists. 
Was studied for 7 days. So far results have 
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Table I are averages of correlations with the 2-year and 4-year 
assessments. They are all very small and statistically unreliable, 
but this is to be expected in highly selected groups. The 
following conclusions are indicated : ` 


TABLE I 


AVERAGE VALIDITY COEFFICIENTS OF PROCEDURES USED 
IN SELECTING CLINICAL PSYCHOLOGISTS 


Correlations 

2yr. 4yr. 

Paper qualifications only, judged by 2 psychologists . AI -22 
+15 125 


Paper qualifications + 1 hour’s interview 
Judgments based on paper qualifications + 
objective tests 3 > 5 > 5 27 +28 
Separate objective tests only : Miller Analogies. al -80 
Certain scores on Guilford-Martin Personality Inventory '22 -16 
Strong Interest Blank, scored for psychologist - . 125 +20 
Strong Interest Blank, scored for clinical psychologist . -82 “16 
Judgments based on above data + a series of projection tests -29 +26 
Separate projection tests, each given by an independent 


“a number of 


tester 
Rorschach Inkblots . . 0 . ° . 12 05 
Thematic Apperception - : R a L 12 
__ Sentence Completion (group test) . 4 : . 19 ‚21 
Ditto + a further intensive interview - 5 a . 27 +26 
Judgments of a team based on all the above - . . 24 -80 
Ditto after observing ‘ Situations ? tests (final prediction) - ZU u 


Situations alone judged by 8 independent observers 


_ 1. The addition of the first or second interview. does not 
increase the validity of predictions. (Note that these judgments 
were not used for the actual selection of candidates.) 

‚2. The study of paper qualifications and objective test scores 
gives almost as good predictions as any of the methods involving 
personal contact with the candidates. The collection of more 
data, or the bringing of additional judges into the decision, 
seldom improves, and may lower, the validity. f 

3. Certain objective tests are superior to any method in- 
volving ‘ clinical’ judgment, though we cannot say ‚whether 
these would stand up as well on subsequent occasions. | A 
battery of the best objective ability and personality tests might 


well achieve a validity of 4. The Situations tests also give 


promising results (cf. group-observation methods, p. 96). 
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4. The projection tests are decidedly less valuable than the 
better objective ones, 


technique is better than 
5. The degree of confi 
provided completely mi 
Kelly and Fiske’s e 
more data the judge o 
likely to Over-weight 


, are of less value for practical predictive 
purposes than quite superficial but more objective methods. 


ical conditions, nor the ability of 
derately good predictions of the onset 
shows that psychiatric sereening 
an Services was of some value ; though 
mental evidence, we cannot state that 
methods might not have been more 
Successful. Again, in Chap. X, w 
tests such as Rorschach and T.A. 
differential diagnosis of mental 
which depend lar, 
appear to be of ver i cational predictions. 
At the same ti ork must be accepted 


writer elsewhere,? point 
nt conclusions. “These candidates too under- 

went a cumulative procedure, though it was less elaborate 
ays), and no objective personality tests 
were used. The criterion against which validity was measured 


* Cf. Wittson €. L., Hunt, W. A., and § 
» C. L., H » W. A., tevenson, I., * A Follow-up 
er of Neuropsychiatric Screening’, J, Abn. Soc. Psychol., 1946, 41, 
* Vernon, P, E., ‘Th idati Ji 
EN e Validation of Ci 


vil Service Selection Board 
Occup. Psychol., 1950, 24, 75-95, 
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refer to 330 selected candidates. They are higher than Kelly 
and Fiske’s, mainly because they have been corrected for 
selectivity. The following points emerge : 

1. Objective tests and examinations have poor predictive 
value. Even the best combination of them would hardly give 
a coefficient greater than -3. Personality tests might have 
helped. Thus the sociometrie rating is promising. 


TABLE II 


VALIDITY COEFFICIENTS OF VARIOUS PARTS OF 
CIVIL SERVICE SELECTION PROCEDURE 


Entrance examinations, or verbal intelligence tests (average 


coefficient) . 1 A : . . 5 be 3 
Observation of discussion among groups of candidates (average for 3 
„„„„ observers) f ; p 5 k : . 82 
Ditto -+ observation of committee and other exercises (2 observers) 44 
Consideration of all above evidence + individual interviews J 
(2 interviewers) 0 à = 


Fi Sociometric ratings by candidates themselves : . 
qa mark after discussion between 3 observer-interviewers » . 
eparate board considers all above evidence and re-interviews +58 


2. With the accumulation of evidence from successive 


exercises (Situations), the validities rise. y 
_ 8. The addition of interviews produces no appreciable 
improvement ; (but the judgments based on exercises may to 
Some extent have been contaminated by interview data, as the 
interviews did not always come last). je 
4. A re-interview by an independent board (containing no 
Psychologists) does improve the final predictions significantly. 
Another relevant research into the assessment of children’s 
Personalities is briefly described by Burt. Using teachers 
ratings as criteria, he found that objective tests and judgments 
ased on projection material gave the poorest validities. 
Individual interviews combined with observation of behaviour 
were more successful, and observations in standard ‚social 
Situations (cf. p. 96) better still. But the combination of 
methods, after discussion among 3 or more observers and 
interviewers, gave the highest validity. 
_There is good evidence also for the value of one type of inter- 
Viewing in the follow-up studies of cases given vocational 
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guidance by the National Institute of Industrial Psychology 
(cf. Vernon and Parry). A few tests of abilities are employed, 
but the recommendations are based mainly on the psychologist’s 
synthesis of information from the school and the parents, and 
from an interview with the candidate. Over 90% of those 
who follow the recommendation are found to be satisfied and 
me years later, as compared with 
against the recommendation. It is 
possible that fuller tests of ability together with some personality 
n this, but it is difficult to see how 
applied when an unlimited range of 
Occupations is under consideration. Here (as also in Civil 
Service selection and in Burt’s investigation), the interviewer 
follows a systematic procedure designed to bring out the most 
relevant traits and interests, but makes little use of psycho- 
i or of projective testing. This may have 

he, apparently, more valid results. 


CONCLUSIONS 


and training them, Clearly it is desirable to develop more 
objective methods, despite the di i i 


Possible, they might well i i 

urrent 
procedures, It would be be if the en se 
confined to assessing certain traits which cannot readily be 
covered by other methods, that is—treatı y 
results are combined with t 
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is used, for example, in the selection of officers for the American 
Army, sometimes also in the British Civil Service and in second- 
ary school selection. Jenkins describes the American officer 
boards, where the interviewers do not explore experience or 
background but look for and assess only those aspects of manner 
and speech, and social traits which are readily brought out in 
an interview situation. It is worth remembering that the 
object of selection is usually to pick people who will impress 
employers, colleagues, and subordinates favourably—not 
merely individuals who will do the job well. Often, therefore, 
the interview may constitute a useful analogous exercise 
(cf. p. 99). à 
We can now turn to a discuss 
methods, or at least a selection of them. There has been so 
much ingenuity in the construction of tests, and so much 
experimentation often leading to variable or contradictory 
results, that we shall consider only those whose value, or lack of 
value, appears to be best authenticated. Elsewhere the writer 
7 Be pointed out that psychological tests derive initially from 
‘Ae methods we normally employ in judging people in everyday 
ife, or in interviewing them, though they are refined in a 
number of ways. ‘A psychological test, by presenting & 
Standardized task or situation, elicits a sample of the testee’s 
Chaviour which can be objectively scored and compared with 
norms of performance, and which has been proved to be 
Predictive of future occupational or other behaviour.’ In the 
field of personality this is an ideal rather than an actuality. 
ew tests have reached a stage where application and scoring 
are as standardized as in testing, say, intelligence ; and few 
Ave trustworthy norms, except for specialized groups such as 
Merican university students. Reliabilities are low, as already 
Pointed out, and validities very difficult to establish. It is 
ants Perhaps, to ask not what tests are available, but a 
methods show most and least promise in the scientific study © 
Personality, 


ion of the more objective 
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Physical Signs of Personality 


PHYSIOGNOMY 


blue eyes—innocence; th i k 
weakness, the forchead intellect, and so on. The ancient Greeks 
attributed to people the ch 


resembled. For example. 


epithets for Personality— 


stic and Judgments of pers 


onality given by 
For example, 


Paterson and Ludgate 1 asked 
1 Paterson, D, G and Lud. $ ie 
rson, i} gate, K. E. Blonde and Brunette Traits : 
A Quantitative Study’, J, Personnel Res., 1922, 1, 122-128, 
32 


Physical Signs of Personality 33 
94 students to assess people well known to them on 26 traits, 
and, on dividing these people into blondes and brunettes, 
found no appreciable difference in any trait. There was no 
support for the common belief that blondes are more domin- 
eering, dynamic, or impatient. The claims of German racial 
theorists regarding the superiority of Nordic physical types to 
Mediterraneans are equally baseless. These so-called races 
are distinguished not only by colouring of hair and eyes, but 
also by shape of skull; Nordics being dolichocephalie or long- 
headed, Mediterraneans brachycephalic or broad-headed. But 
experiment has shown no relation between length or breadth 
of skull and either intelligence or other psychological qualities. 
Another influential physiognomical theory was that of 
Lombroso, the Italian criminologist, who claimed that criminals, 
belong to a degenerate physical type, which can be recognized 
by characteristic features or stigmata. But he omitted to 
enquire how often these same stigmata occur in non-criminals. 
oe research certainly shows poor physique and physical 
pa to be somewhat more common among criminals and 
elinquents, but there is no distinct criminal type. _ 
Palmistry and Chirognomy. Psychologists often classify these’ 
along with astrology, phrenology and physiognomy as mere 
Charlatanry. Recently, however, C. Wolff! has provided 
Plausible reasons why certain dimensions of the hand and 
ingers, and lines on the palm, might be affected by endocrino- 
gical and neurological conditions, and thus indirectly reflect 
ealth, temperament, and intelligence. She describes a well- 
Controlled experiment in reading from the hands alone ; her 


Indgments of the personality traits of 24 students ar 
eir self-judgment: obvious] unsatisfactory cr! erion) to a 
Teran í So far her methods 


small but statistically significant extent. 
have not been en others, and it is doubtful whether 
Most professional palmists work on equally sound principles. 
Readers may object that they themselves or their acquaint: 
ances have had remarkably accurate character-readings from 


Palmists, phrenologists, and the like. Quite apart from fraud, 
é First, there are a 


ere are good reasons why this is possible. 
Number of vague characteristics which anyone will accept as 
applying to themselves, e.g. ‘ strong sense of humour, abilities 


Vout, C., ‘ Character and Mentality as Related to Hand-Markings Hy 
it. J. Med. Psychol., 1941, 18, 364-382. 
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i ”, ete. 

fully a preciated by others »€ 
of 5 SR ieo e diagnosis to 
on the average each of them considere: 


feminine, 
show some of thi 


or weight, also size of head, and intelligence 
or academic ability on the one hand, and le 
traits on the other, 


S some 2% larger in volume 
. Industrial executives, 
gious leaders have bee 


periment was that of Murdock and Sullivan,‘ 


Be ees Tsonality. New York : 
en ang a teldon (cf. p, 37) has also described a feminine 
Physical type in m i ality correlates 
1 f ies of Genius, Vol. I, Stanford, 
versity Press, 1995, 
ock, K., and ullivan, L, R., ‘A Contribution to the Study of 
Mental and Physical Measurements in Normal Children’, Amer. Phys. 
Educ. Rev., 1923, 28. 
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Lia Konad correlations of +14 to 16 between the heights and 
Th Ar of 600 children and their intelligence test scores. 
a Saal not be concluded that physical size means a 
a rain and that a large brain indieates intelligence. It is 
east as likely that the more intelligent and successful 


` individuals have usually been brought up in more favourable 


er and so tend to be superior in health and physique. 
Tara in the evolution of animal species, inerease in size of 
Bedi, or cerebrum goes with increase in intelligence, and 
Dimi the white human race has larger brains than certain 
angen ive peoples such as Australian aboriginals, there is no. 
char le correlation among whites. It has been suggested that 
ae exity of convolutions or other features of the brain, 
physiol than mere size, underlie intelligence. But in fact 
le ogists cannot, at present, tell us anything about the 
from ectual or other personality characteristics of an individual 
ene his brain, except in cases of disease or gross 
for eh We need hardly add that there is no justification 
an y of the claims of the phrenologists. Not only is it untrue 
parti mental capacities and personality traits depend on 
ar sections of the cerebrum, but also the strength of a 
ty would not affect the size of that section, nor produce 


any swelling visible on the skull surface. 


TYPES OF BODY BUILD 


A third and more important correlation is that between 


hody build and certain traits which may be roughly labelled 

e pa Oa xira verion: The notion of physical and tempera- 

in hi al types has a lengthy history, well described by Roback * 
is Psychology of Character. Shakespeare summed it up: 


Let me have men about me that are fat ; 

Sleek-hended men and such as sleep 0° nights : 

Yond Cassius has a lean and hungry look ; 

He thinks too much : such men are dangerous. 

n is that of Kretschmer,? 


The best-known modern formulatio 
phrenic patients were of 


who found that a majority of schizo 


1 Roback, A. A., The Psychology of Character. 


Brace, 19 
, 1927. 
? Kretschmer, E., Physique and Character. London : Kegan Paul, 1925. 


New York, Harcourt, 
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asthenic or leptosome (tall-thin-pale) and athletic (intermediate) 
build, whereas a majority of cycloid or manie depressie 
patients were of pyknic (rotund-florid) Physique. He extende 


genial, and sociable, He also cl: e 
body build, Stating that they show distinctive philosophies or 
interests. For example, realists, humorists, and materialistic 
Scientists tend to be pyknic, while romantics, idealist philoso- 

phers and metaphysicians tend to be asthenics, 3 
Ow we cannot accept Kretschmer’s theories without 
qualifications. Later Surveys, even of psychotic patients, do 
i i Particularly when age is 
controlled. (F i ive insanity tends to occur later 


me-cyclothyme dimension. He did find, 
< Neurotic conditions of hysteria and dysthymia 
(anxiety -obsessi inuous with the normal, and 


ily build, Perhaps the most striking 

ificati > who prefers Viola’s 
classification of build into macrosplanchnie (predominance of 
i microsplanchnic (predominance of 
l over horizontal di Kretschmer’s. He 


pe 4 Tetschmer himself has ac 
criteria for classification 


The Scients : s f 
ledge, 1952, A pertot Scientific a pal Y Personality. London: Rout 


Pi 
Totation to a criterion ’, since it 
he Anal, 5 Titerion, i 
a. L. e An ? 5 j 
1938, 17, 158-185. P yels of Temperament *. Brit, J, Med. Psychol., 
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subjective. Moreover, his theory implies that every individual 
falls definitely into one of four distinct types, rather than that 
build is a matter of degree ; (the fourth type is the dysplastic or 
irregular). Other writers have suggested various more objective 
and quantitative morphological indices. Eysenck compared a 
large number of bodily measurements by factor analysis, and 
found the following simple index to be representative : 
Height x 100 

6 x Chest Diameter’ 
of 100, while asthenics range up to about 180, pyknics down to 
about 70. 

In recent years an ingenious threefold classification of build 
and temperament has been put forward by Sheldon.2 It is 
based on the relative development of three bodily components : 
endomorphy (roundness, softness), mesomorphy (hardness, 
muscularity) and ectomorphy (delicate,.‘ linear ’ physique with 
weak development of both visceral and somatic structures). An 
individual is photographed from standard positions. From 
measurements of the pictures he is assigned a threefold rating 
on a 1 to 7 scale, indicating his standing on each component. 
For example a definite pyknic, on Kretschmer’s system, might 
be 711, and a moderate asthenie 246. Sheldon’s tempera- 
mental classification, which is claimed to correspond closely to 
the physical, is based on thorough clinical interviewing and 
observation. By means of ratings on 60 traits the relative 
prominence of viscerotonia (sociable, affectionate, love of 
comfort), somatotonia (vigorous, assertive, love of muscular 
activity), and cerebrotonia (reserved, love of privacy and mental 
activity), is determined. Sheldon’s original correlations are 
obviously spurious, and later work suggests that the connection 
between his physical types and personality traits amounts to the 
usual figure of around 2 to -3.2 Investigations have been made 


The average male adult obtains an index 


1Sheldon, W. H., and Stevens, S. S., The Varieties of Temperament. 
New York: Harper, 1942. See also, Hunt, J. McV., Bibliography. 

2 Cf. Child, I. L., and Sheldon, W. H., ‘ The Correlation Between Com- 
ponents of Physique and Scores on Certain Psychological Tests y Char. 
& Person., 1941, 10, 23-34. Fiske, D. W., ‘A Study of Relationships to 
Somatotype’. J. Appl. Psychol., 1944, 28, 504-519. Smith, H. C., 
* Psychometric Checks on Hypotheses Derived from Sheldon’s Work on 
Physique and Temperament’. J. Person., 1949, 17, 310-320. Child, I. L., 
“The Relation of Somatotype to Self-ratings on Sheldon’s Temperamental 
Traits’. J. Person., 1950, 18, 440-453. 
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into the somatotypes of officer candidates and air-force pilots, 
but there seems to be no evidence that they are of any value te 
selecting men of suitable temperament. Like Kretschmer’s 
types, somatotypes probably depend considerably on age. 


PSYCHOSOMATIC AND ENDOCRINOLOGICAL 
RELATIONSHIPS 


Although these small positive corr 


elations are almost the 
only examples of direct relations betw i 


ess. Demosthenes’s 


y small stature not infrequently display 
But there is, of course, 


ase elicited in medical 
Another physical fact to which 


f + F., Emotions nd i] k: 
Columbia University Press, 1938, les Boa) Changes. New Yor 
aterson, H, F., ¢ Organic Inferiori and thi Eee itude’. 
Ae Psychol, : 1981, 2, 87-101. ity e Inferiority Attitu 
ars H. E- ‘Order of Bi h in Relation to the D, 1 f the 
Child’. 4 Han dbook of Child m or ) e Development o; 
Mass. : Clark University Brea, os mene Be son Ne 
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however, seem to be a slight tendency for only children to be more 
susceptible to maladjustment and delinquency than others. 

There is some connection, again, between lefthandedness (and 
possibly left-eyedness) and personality. Certainly an undue 
proportion of delinquent and of educationally backward 
children are left-handed. But we cannot say whether left- 
handedness is a symptom of nervous and temperamental 
instability, or whether some young children with maladjusted 
personalities tend unconsciously to express their revolt against 
society by contrariness over handedness. Burt? gives an excellent 
discussion of the complex nature and origins of handedness. 
Certain sensory deficiencies are often the product of neurotic, 
rather than of purely physiological, conditions. Thus Eysenck 
finds defective dark adaptation or night vision useful as a measure 
of neuroticism. Similarly Slater ? obtained poor visual acuity 
more frequently among neurotic patients than among normals. 

Peptic ulcer is the most notorious example of a psychosomatic 
disorder. A large proportion of ulcer patients (though by no 
means all) tend to show a characteristically drawn and anxious 
facial expression, and to be asthenic in physique. They are 
often highly vigorous and ambitious people who drive themselves 
to the detriment of their health and digestion ; and the develop- 
ment of the ulcer often follows some serious disappointment or 
frustration in their lives ® But it is hardly possible to say how 
far physical factors underlie the psychological, or vice versa. 
Certain forms of asthma and many other diseases have similarly 
been shown to be associated with, or to be unconscious expres- 
sions of, psychological mechanisms. Yet there is no one-to-one 
connection which would justify the diagnosis of people who are 
liable to particular illnesses as always belonging to particular 
personality types. 

The notion that bodily chemistry underlies temperament goes 
back to the Greeks and Romans, to Hippocrates and Galen.* 
The four classical types of temperament—sanguine, cholerie, 


1 Burt, C. L., The Backward Child. London: University of London Press, 


19837. 
2 Slater, E., and Slater, P., ‘ A Heuristic Theory of Neurosis’. J. Neurol. 


Neurosurg. & Psychiat., 1944, 7, 49-55. 
3 Cf. Davies, D. T., and Wilson, A. T. M., ‘ Observations on the Life- 


History of Chronic Peptic Ulcer’. Lancet, 1937, 1353-1360. 
4 Cf. Smith, M., ‘ The Nervous Temperament ’. Brit. J. Med, Psychol., 
1930, 10, 99-174. 


40 Personality Tests and Assessments 


prominence of four bodily fluids 
bile, black bile, and phl 


ir secretions on growth, health, and 
1 for example, not only describes 


ity and insanity largely to glandular a 
orders. Now it is certainly false to regard each hormone 


ay in some cases bring 

-) Itis far too simple, for example, 

to regard the sex glands as determining masculine and feminine 

» or their efflorescence during 

puberty as Producing the Well-known instability of adolescence. 
For Psychologists Tecognize th 


therefore, meb 
od tests, or X-rays 4 
nything more than very 2: 
Almost the only well- 

the endocrine system, 

of the Pituitary, are found in a considerable 

Proportion, Perhaps 20% of children showing serious delin- 
quency or maladjustment 2 ut it is not clear whether such 


diagnosed easily and objectively, nor how 
frequently they occur in Psychologically normal children. Some 
Vestigators claim a correlation between alkalinity of body 


this is denied by others. 
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rin Ui y Press, 1936. Lurie, L. A., 
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Psychiat., 1998 = 158, 07 Disorders of Child, 

a 
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One would expect that persons with high basal metabolism would 
in general be more active and energetic than those who live ata 
lower rate. An investigation by Dispensa was unpromising, but 
both Sanford and Herrington provide confirmatory evidence.! 
Another plausible theory which may help to link biochemical 
factors with bodily build and temperament is that some people 
are dominated by the sympathetic nervous system, others by 
the parasympathetic. ‘ Sympathicotonics” are supposed to be 
more dominating, impulsive, active, while ‘ vagotonics’ are 
more anxious, depressed, and cautious. A large amount of 
work has been done with such indices of autonomic activity as 
basal metabolism, pulse, respiration, blood pressure, salivation, 
tendency to flushing, and sweating, etc. Unfortunately these 
variables fluctuate widely from day to day, and are mostly 
very specific, thus it is difficult to get reliable diagnostic 
measures on reasonable numbers of cases. And though factor 
analysis has been applied, there is no agreement as to what are 
the main physiological ‘ dimensions ’.* Nevertheless the 
evidence suggests a general physiological activity factor, and 
an autonomic imbalance or sympathetic vs. parasympathetic 
factor. Sanford finds that parasympathetic response does link 
with asthenic build and (negatively) with social, outgoing, 
lively personality traits. Similarly Eysenck, using Wenger’s 
best index—the salivation rate—found it to be higher among 
hysterie than dysthymic patients. He admits that this might 
be due to greater anxiety at being tested among dysthymics, 
which would inhibit salivation. Darling ® considers that 
measures of psychogalvanic response (cf. below) and of blood 
pressure in children (which tend to be inversely | related) 
correspond to parasympathetic and sympathetic activity. The 
‘Relationship of the Thyroid with Intelligence and 
Personality’. J. Psychol., 1938, 6, 181-186. Sanford, R. N., et. al., 
* Physique, Personality and Scholarship ’. Monogr. Soc. Res. Child Devel., 
1943, 8, No. 34. Herrington, L. P., ‘ The Relation of Physiological and 
Social Indices of Activity Level’. Studies in Personality (edit. Q. McNemar 
and M. A. Merrill). New York: McGraw-Hill, 1942. — K 
2Cf. Darling, R. P., ‘ Autonomic Action in Relation to Personality 
Traits of Children’. J. Abn. Soc. Psychol., 1940, 35, 246-260. Wenger, 
M. A., ‘ Studies of Autonomic Balance in Army Air Forces Personnel ’. 
Compar. Psychol. Monogr., 1948, 19, No. 4. Cattell, R. B., Personality. 
nee York «McGraw-Hill, 1950. Sanford, R. N., op. cit. Herrington, 
~ P., op. it. 3 
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t -8 with ratings on 
Activity, Alertness, Co-operativeness, and Attention. 


Two other theories should be mentioned. McDougall ! 
m inhibition of the lower 
by the higher nervous centres, and that this is released to 
ences; in other words, the 
te akin to mild intoxication. 
tests (cf. p. 79). Finally, there 
B (Basedowoid) types, which are 
athyroid and thyroid glands, and 
Originally derived from differences 


and inflexible, During the Nazi régi 


entangled with so-called racial psycholo ‚ and hardly merits 
further consideration. Ry ee! 


Person is stimulated say, by an electric shock, 
En ae a ofa shoe » or by emotionally toned words in a 
10n experiment (ef, p. 172). Li d 

Pressure and resphar (cf. p ). Like the pulse, bloo! 


- J. Abn. Soc. Psychol., 1930, 24, 293-309. 
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particular mental state. The PGR is the end result of a variety 
of factors—circulatory, postural, sweat secretion and tempera- 
ture, not to speak of the thickness of the cuticle, the position 
of the electrodes, and other vagaries of the apparatus ; and it 
is elicited to varying extents in different people by a great 
variety of stimuli. Thus, although twenty or more experiments 
on the psychological correlates have been published, their 


Hand N, 


7, 100 ohms 


Battery 


Variable N x! 1000 ohms 
Resistance‘, / seb 


NER 


N 


Fig. 2.—Psychogalvanic Reflex. Electrodes are attached 
to the back and palm of the subject's hand, or to two 
fingers, and are connected to a battery by a Wheatstone 
Bridge circuit. ‘The variable resistance is then adjusted 
until no deflection appears in the galvanometer ; the 
subject’s resistance (multiplied by 10) can be read off. 
When a stimulus is applied the resistance tends to drop, 
and the galvanometer needle moves. 


results are largely contradictory. Perhaps the most thorough 
work is that of Darrow and Heath.! Neither initial, nor 
average resistance, nor total drop in resistance during the 
application of a series of standard stimuli, show any clear 
relation to other tests or ratings of emotional traits, though 
rapid recovery rate from stimulation appears more promising. 
Since there is a definite tendency for more emotional word 
associations to evoke bigger responses than less emotional,? 


1 Darrow, C. W., and Heath, L. L., ‘ Reaction Tendencies Relating to 
ersonality’. Studies in the Dynamics of Behavior (edit. K. S. Lashley). 


hicago, Til. : Chicago University Press, 1932. r, 
Cf. Smith, W. W., The Measurement of Emotion. London : Kegan 


Paul, 1922. 
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bear this out, but others. į 


We would conclude that the PGR may be 
tudies of the associations or other stimuli 

n individual, but that it can 
hardly be employed as a generalized test of any identifiable 
Personality trait, 


The electroeneephalograph, 


which records the overall 
electrical activity of the b 


rain in the living subject, has also 


age during early childhood, but is apparently 
unrelated to inte 


distinguish visual thinkers from verbalizers among adults by 
means of their EEG recor 


med. One very useful finding i 


., a conclusion : although this chapter is so largely negative, 
it would be as incorrect to Say that physical characteristics are 
ality as it would be to clai 


ill, or that slen 


: D at we have shown is that, with 
rare exceptions, no isolated Physical or che 
i E 


: ence any system of personality 
an Such signs is certain te be false. None the 


tors may exert an 
development of the individual 
* Walter, w, 
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personality. Some of these—such as body build, and certain 
chemical, psychogalvanic and EEG measures—might occasion- 
ally be included in appropriate trait composites, although 
they do not constitute samples of behaviour in the sense 
described in Chap. I. We shall see, also, in the next chapter 
that the features, hands and body may, when subjectively 
interpreted, be expressive of personality. 


IV 


Expressive Movements 


JUDGMENTS FROM PHOTOGRAPHS 


E judging personality from external appearance we normally 
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irritable or the good-humoured individual might or might not 
habitually show the expressions typical of anger or happiness, 
respectively. 

The ranking of photographs for intelligence gives very low 
average correlations of -+-10 or less with intelligence test scores, 
though some raters achieve fairly high correlations and others 
negative coefficients. There is no tendency for psychologists, 
teachers, or doctors to do any better at this than other judges, 
and women are no more successful than men. It is found, 
however, that most judges agree more closely with one another 
than with the true order; in other words, that there is a fairly 
widespread convention as to what the intelligent or dull person 
looks like—a convention which is largely fallacious. Pintner* 
questioned his judges and found that they relied on a diversity 
of signs such as bright eyes, wearing spectacles, also on resem- 
blances to acquaintances. Judgments of personality traits 
such as sociability, efficiency, energy, humour, etc., may be 
slightly more successful, though it is, of course, more difficult 
to get an objective criterion of accuracy, and most investigators 
have employed ratings by associates. Burt? obtained coeffi- 
cients ranging up to +787 for certain traits, though the average 
was only +-18; and the present writer obtained an average 
of only -01 when rankings of 10 students’ photographs on 4 
traits were compared with composite measures of these traits. 
It was noticeable that the judgments of the different traits— 
Intelligence, artistic interests, sociability and efficiency—over- 
lapped considerably ; and that the 5 students who received 
the highest average judgments on all traits were all smiling 
when photographed, whereas the 5 with low judgments all 
happened to be caught frowning or with solemn or disagreeable 
expressions. Uhrbrock has published similar results, and 

hornton shows that wearers of spectacles tend to be rated 
especially high in intelligence, industriousness, and honesty.? 

1 Pintner, R., ‘ Intelligence as Estimated from Photographs’. Psychol. 
Rev., 1918, 25, 286-206. eam 
ur, €. L., ‘ Facial Expression as an Index of Mentality in Children °. 

ld Study, 1919, 12, 1-10. A A 

Uhrbrock, R. S., ‘ Estimating Intelligence from Photographs’. Proc. 
Intern. Cong. Psychol. Princeton, N.J. : Psychol. Rev. Co., 1930, 
f Wearing Glasses upon Judg- 


451-452. Thornton, G. R., ‘ The Effect 0 
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appearance, and the majority identified him as the French 
premier. Nevertheless one quarter of all the judgments were 
correct, where one ninth would be expected by pure chance, 
suggesting therefore that people do to some extent conform to 
our stereotypes, or that our judgments do have some slight 
validity. Arnheim, Gahagan, and the present writer 1 carried 
out numerous experiments where small sets of photos of un- 
known writers, politicians, or other persons had to be matched 
with excerpts from their writings, with their professions, with 
short case-studies, specimens of handwriting; records of their 
Voices and so forth, and in almost all of these a considerable 
Superiority to chance success was found, corresponding to 
coefficients of around -4. Yet another technique which works 
well is to get the judges to write free characterizations of the 
Photographees in their own words. Most of the sketches 
appear to contain a good deal of accurate material, and this 
can be proved objectively by asking other judges who know 
the original photographees to guess which sketch refers to 
which person. 

In this field of expression the matching method ? is an advance 
on methods which deal with one trait at a time. But it has 
various weaknesses. Although numerous judges can be used, 
they cannot deal with more than about half a dozen person- 
alities at once. Hence their success is very variable, depending 
largely on the heterogeneity of the particular personalities. , It 
is better for each judge to match several sets of material, 
chosen at random. When matching, say, photographs with 
character sketches or case-studies, it often happens that some 
single phrase in a sketch happens to give the clue. A remark 
about health or stature, neatness, maturity, ete., may allow an 
identification, without any real consideration of the personality 
as a whole or the appearance as a whole. , Other photographs 
in the set which do not happen to fit in with the judges 
Stereotypes may never be matched correctly except by a process 


of elimination. 

1 Arnhei ibe ‚rimentell-psychologische Untersuchungen zum 
Ausdrucksproblem = chal mori 1928, 11, 1-132. END 

udgments of Oceupations from Printed Photographs ’. J. Soc. Psycho 
1988, 4, 128-184. Vernon, P. E., ‘ Can the “ Total Personality ” be Studie: 
Objectively 2? Char. & Person., 1935, 4, 1-10, 

* Cf. Vernon, Bibliography. 
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EXPRESSIVE MOVEMENTS 
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‘taught, and base his interpretations on deviations from this 


model. Movements are much affected also by health or fatigue, 
and by environmental restrietions such as the pressure of 
elothes or furniture on the body or, in the case of writing, the 
kind of pen-nib and paper. 

This does not mean, however, that our non-adaptive move- 
ments are merely specific learned habits. Thus every adult’s 
writing differs from what he or she was originally taught in a 
manner peculiar to himself. Moreover, this individual style 
does not depend only on the finger and wrist muscles and nerves, 
for it reappears if one writes on the blackboard, or with the 
left hand, or with the toe on a smooth sand surface. Hand- 
writing is a particularly useful mode of expressive movement to 
study because it leaves a permanent trace; it is a crystallized 
gesture. But the same conclusions hold good for any other 
mode. How they arise, and how far they reflect either conscious 
traits or unconscious tendencies, is too complex a psychological 
problem to be considered here. Only in rare instances, such as 
obsessional hand-washing, can the clinical psychologist trace 
fairly clearly the origin of any particular movement. And 
this means that the interpretation of any expression is highly 
subjective and uncertain. The obvious deductions—that a 
loud voice shows domineeringness, & flabby handshake weak- 
ness, an illegible handwriting disorderliness, are probably as 
often as not untrue. Many expressions are compensatory. 
Sometimes different parts of the body are contradictory, as 
when a shy and nervous person controls his face and voice but 
gives himself away with his hands or feet. And intentional 
distortion of manner, or playing a part, is not difficult ; many 
People, such as salesman, largely live by it. 

Probably our judgments of temporary emotions or moods are 
reasonably successful, in that they enable us to get on with one 
another in daily life, without too many misunderstandings. 
But, as in the case of the features, it does not follow that more 
Permanent traits can be inferred or intuited. One thing we 
do know from experiment js that there is a strong trend towards 
Teliability and consistency. Allport and Vernon measured the 
Speed, extension, and pressure of a large number of simple 
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variations. This continuous record from both hands shows 
irregularities when highly emotional verbal stimuli are given 
(in this respect it is quite similar to the psychogalvanic reflex), 
or when the subject is under strain, and even more marked 
disorganization occurs among neurotics (cf. Fig. 8). These effects 
are widely confirmed in the literature, though it can scarcely be 
said that they have yet led to a simple and straightforward test. 
Clarke, and Albino? report correlations between neuroticism 
and right-hand, and left-hand disturbances, respectively, during 
word association tests. Several studies of muscular tension in 
young children have been carried out by Duffy,? where the 
pressure of the hand on a rubber bulb was recorded during 
Teaction time and other tests. She claims correlations of 
around -5 with assessments of the children’s excitability and 


emotionality. 


VOICE SPEECH, AND HANDWRITING 


The voice is an interesting form of gesture which happens to 
be made by the throat muscles, and to be audible instead of 
visible. Intonation and dynamics; speed, rhythm, and 
continuity ; pronunciation ; vocabulary and choice of words, 
and style all provide important clues in our reactions to one 
another’s personalities.? Pear 4 has pointed out the rôle of the 
voice in judgments of social class and, in an experiment over 
the radio, found that occupation and age can be guessed fairly 
accurately. Dusenbury and Knower ë showed that emotions 
can be expressed by, and recognized from, tone of voice as 
accurately as from facial expressions. Asthenie or pyknic 


1 Clarke, A. D. B., The Measurement of Emotional Instability by Means of 
Objective Tests. Ph.D. Thesis, University of London, 1950. Albino, R. C., 
‘The Stable and Labile Personality Types of Luria in Clinically Normal 
Individuals’, Brit. J. Psychol., 1948, 89, 54-60. h 

2 Duffy, E., ‘ The Measurement of Muscular Tension as a Technique for 
the Study of Emotional Tendencies ’. Amer. J. Psychol., 1932, 44, 146-162. 

3Cf. the analyses by Newman, S., and Mather, V. G., ‘ Analysis of 
Spoken Language of Patients with Affective Disorders’. Amer. J. 
Psychiat., 1938, 94, 913-942 ; and Moses, P. J., ‘ The Study of Personality 
from Records of the Voice’. J. Consult. Psychol., 1942, 6, 257-261. 

1 *Pear, T. H., Voice and Personality. London: Chapman and Hall, 
931. 

ë Dusenbury, D., and Knower, F. H., ‘ Experimental Studies of the 

Symbolism of Action and Voice, II’. Quart, J. Speech, 1939, 25, 67-75. 
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a number of quantitative measures of language, such as the 
Type/Token ratio—the number of different words over total 
words. One of the most promising is the Verb/Adjective 
ratio, which has been shown by Busemann? to correlate 
positively with ratings of emotional stability. Again Balken 
and Masserman® find the highest proportion of verbs in the 
speech of anxiety neurotics, the lowest among hysteric patients. 
Much investigation needs to be done into the consistency 
of speech measures; it is only too likely that a person’s 
style varies greatly in different contexts and different social 
Situations. 

In the ordinary interview, the interviewer not only has the 
full range of modes of expression to judge from, but can also 
take account of the content of what the interviewee says. The 
fact that interview judgments are often so inconsistent and 
inaccurate, even with this additional data, indicates that 
expressive movements and voice alone are too readily misinter- 
preted to yield any easy test of personality which would possess 
high validity. 

For this reason one would not expect the analysis of hand- 
writing to be as useful as graphologists claim, though it has the 
advantages not only of permanence but also of being more 
Spontaneous and less liable to intentional distortion or disguise 
than any other mode of expression. It has the disadvantage 
that only about a quarter of the population are sufficiently 
fluent writers to have developed a mature, individual, style. 
Graphologists admit that children’s and poorly educated adults’ 
Scripts are much less revealing, though H. J. Jacoby believes 
that even the scribblings of pre-school children are to some 
extent diagnostic. (His book, Analysis of Handwriting,? may 
be recommended for a general review. Allport and Vernon, and 

ell, survey the psychological literature.) j 

e must distinguish clearly between the older graphological 
Systems which attributed significance to specific signs or details 
(such as length of t-bars, height of upper and lower “projec” 
tions’, etc.) and more modern methods based on the work of 
taae nenn, A., Die Sprache der Jugend als Ausdruck der Entwick- 
„Balken, E. Hu and en, J. HL, “The Language of Phantasy * 

- Psychol., 1940, 10, 75-86. 
London : Allen and Unwin, 1939. 
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employed more specific signs. Powers,! also Cantril and Rand ? 
in other matching experiments got moderately good results 
from professional graphologists, and poorer ones from non- 
graphologists. Stein Lewinson è was able to predict failures in 
college work and personality maladjustment with considerable 
success from her analyses of the handwritings of women 
students. Eysenck 4 found that a graphologist could to some 
extent predict the answers of neurotic patients to a personality 
questionnaire, and could match her analyses with personality 
diagnoses written by psychiatrists. Some of the patients 
appeared to express themselves in their writing much more 
completely than others, and some questions were much better 
guessed than others. In a later research,® another graphologist 
(considered to be professionally skilled) entirely failed to 
differentiate the handwritings of a group of neuroties from those 
of normals. There is plenty of evidence, however, that certain 
types of psychotics show strongly differentiated handwriting 
characteristics. This is summarized by Bell. 
_ In conclusion it should be pointed out that modern graphology 
is a highly skilled art and science. The amateur who attempts 
to apply it after reading one or two books is unlikely to give 
diagnoses of any value at all. Many so-called professionals are 
also inept, and there is no ready means of distinguishing the 
bad from the good. The best undoubtedly are able to produce 
very penetrating diagnoses of some (though not all) mature 
writers. But even they are limited in what they can cover. 
or example, handwriting gives little scope for the recognition 
of special talents, or social attitudes. It may chiefly be expected 
to throw light on the emotional structure, conscious and 
Unconscious, of the personality, on character integration and 
Neurotic mechanisms. How completely these are expressed in 
graphic movements we do not yet know. 


$ Powers, E., cf. Allport and Vernon, Bibliography- 
Cantril, H and Rand, H. A., ‘An Additional Study of the Deter- 
don of Personal Interests by Psychological and Graphological 
ethods’. Char. & Person., 1984, 3, 72-78. ? d 
Cf. Munroe, R., Stein Lewinson, T., and Waehner, T. S., A Comparison 
of Three Projective Methods’. Char. & Person., 1944, 13, 1-21. i 
* Eysenck, H. J., ‘ Graphological Analysis and Psychiatry : An Experi- 
mental Study’. Brit. J. Psychol., 1945, 85, 70-81. 
$ Eysenck, H. J., ‘ “ Neurotieism ” and Handwriting’. J. Abn. Soc. 
Psychol., 1948, 43, 94-96. 
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Handwriting pressure and its variations constitute an 
important element in graphological analysis. Usually they are 
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the pen); though almost all of these have the disadvantage of 
interfering to some extent with normal writing movements. 
Such records show marked individual differences both of average 
pressure and of pattern, which appear to reflect the personalities 
of the writers (cf. Fig. 4), and which are characteristically 
distorted in some organic and functional disorders. Both 
Allport and Vernon, and Pascal,! quote correlations of :5 to 6 
between point-pressure and such traits as Energy and Expres- 
siveness. More systematic exploration of the possibilities of 
such apparatus is badly needed. 

‚Two other methods developed out of graphic movements by 
clinical psychologists deserve mention. Mira’s 2? myokinetic 
diagnosis (P.M.K.) requires the patient to draw with each 
hand in turn series of straight, zigzag and circular lines of 
Standard form, in various spatial directions. Half-way through 
each task his vision is blocked, and the subsequent kinaesthetic- 
ally-guided movements naturally tend to diverge from the 
original form. The type of drift is claimed to throw light on 
inner emotional and aggressive trends. Bender’s® Visual- 
Motor Gestalt test gives scope for distorted perception as well as 
inaccurate motor reproduction of shapes. It is based, like the 

inet Memory for Designs, on observation and drawing of a 
number of simple figures. Characteristic disturbances occur ın 
cases of nervous and mental disorder. Similarly Bühler 4 has 
described abnormal features in responses to the Stanford-Binet 

all and Field test among neurotic children. There is little 
evidence as to the diagnostic possibilities of such tests in more 
Normal personalities. 4 3 

tistic productions and literary style might logically be 
Considered here, but are more conveniently postponed to the 


chapter on projective techniques (pp. 180, 192). 


* Pascal, G. R., ‘ Handwriting Pressure : Its Measurement and Signifi- 
Tae Char. & Person., 1948, 11, 285-254. 
83, ra Myokinetic Psychodiagnosis . 
Bender, L., ‘ A Visual Motor Gestalt Test 
Ana Amer. Orthopsychiat. Ass. 1938, No. 
Suttell, B. J., The Bender-Gestalt Test. 
Tatton, 1951. : ae 
Ei Bühler, C., ‘The Ball and Field Test as a Help in the Diagnosis of 
motional Difficulty’. Char. & Person., 1938, 6, 257-278. 
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Most psychologists who have written about individua. value 
Merrill-Palmer, or Performance tests, have stressed t xe social 
of observing the child’s reactions to difficulties and his 
behaviour in the testing situatio 
contain rating 


A nly 
scales for the personality traits which commo. 
emerge. Goodenough ? fou: 


a 
distractibility i as rated by different test E 
» are fairly reliable, at least over short Defere nt 
irely differently on differ” 
On the next page is Ba 
ng sheet which the w 


o 
the vertical lines in or ” 3 
express rapidly the approximate standing of the subjec Ja 
each quality. Additional items may be added for particu! 
tests, e.g. the Porteus Mazes. 


1 Burt, C. L., The Youn Delinquent. 
Press, 1925. e Sy 

* Goodenough, F. L., ‘ The Emoti 
during Mental Tests’. J. Juven. Res. 


R don 
London : University of Lon 


Haren 
ional Behavior of Young Childr 
» 1929, 13, 204-219. 
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Although the tester can base his judgments on the full range 
of expressive movements, including conversation, the method 
has serious limitations. The subject is in an unusual situation, 
and there is no guarantee that he will react to the problems set 
before him in the same manner as he habitually reacts in daily 
life. Particularly in testing adults, it is difficult to produce a 
natural, spontaneous, atmosphere. It is only too easy, also, 
to Jump to conclusions from slight evidence, to misinterpret 
facial, vocal, or other expressions, or to be biased by chance 
resemblances to acquaintances, by general like or dislike of the 
testee, or by hasty first impressions, etc. On the other hand 
the psychologist usually has the advantage of coming fresh to 
each testee, uninfluenced by previous knowledge of him. 
Clearly everything depends on his impartiality, experience, and 
Intuitive skill. 

In one experiment by the writer, 25 students were observed 
by 8 testers (2 of them quite inexperienced) in three different 
performance test sessions. The average agreement between 
their impressions, determined by the matching of personality 
sketches, was represented by a correlation of -72. They also 
rated the subjects on Practical Intelligence, Quickness, and 
Impulsiveness, Extraversion-Introversion, and Emotional 
Stability. The average inter-correlation of :56 shows a con- 
Siderable amount of variation either in the subjects’ behaviour 
or in the testers’ interpretations. Nevertheless the correlations 
of the summed judgments with composite measures of the same 
traits averaged 50 (43 to -61), showing a very promising 
validity. 

The method can be made still more useful by including a wide 
variety of tests which will stimulate the subjects to display more 
Significant behaviour. Ordinary individual intelligence, educa- 
tional and performance tests are hardly provocative enough. ‚It 
can also be developed in such a way as to yield quantitative 
Indices at least of certain traits, and thus reduce the amount of 
Subjective judgment. A good example of this is the Q or Quality 
Score in the Porteus Mazes,? based on the number of times the 
Subject crosses or touches the printed lines, cuts corners, starts 
to go up wrong turnings, lifts his pencil, ete. Porteus states 


+ Vernon, op. cit., p. 49 
* Porteus, $. D., "Qualitatioe Performance in the Maze Test. Vineland, 


N.J. : Smith Printing House, 1942. 


ENERAI OBSERVATIONS 
RATING SCALE FOR QUALITATIVE 
g DURING TESTING AND INTERVIEWING 


Examiner ......... 
Ys 
eee restless, unable to keep still Impulsive 
Quick and vivacious _ an 
ious 
Calm and deliberate Cauti 
Inert and listless Inhibited 


Poses, motor attitudes. 
Tics .... 
Fiddling with material. 
Peculiar expressions. 
MOVEMENT 


Fluent and graceful 
Accurate and well-controlled 


Angular and awkward 
Clumsy 


Quick stride and movements 
Slow stride and movements 


PHYSIQUE AND BEARING d 
Impressive in bearing Healthy looking, well developed and nourishe 
Satisfactory impression | 


Unimpressive Unhealthy, feeble physique 
Forceful, efficient, energetic, upright posture and gait 
Slouching gait 


eak, inefficient, movements and bearing 


Plump (pyknic)proportions 
Well and symmetrically proportioned 
Thin (asthenic) 


PERSONAL AP: 


Florid 


Pale 


PEARANCE AND EXPRESSION 


Attractive and good-looking (positive reaction) 

Pleasant Sensual 

Uninteresting, indifferent attractiveness 

Ugly and repulsive (negative reaction) Effeminate 
| Strong expressiveness of face and gestures Frank 

Expressionless | Secretive 

Quick ana strong sense of humour Cheerful, optimistic 
Slow but sure | Depressed, melancholy 
Unable to see humour 

Mature, 


Serious, philosophica] Excitable, irritable 
Even-tempered 
Calm, phlegmatic 


Immature, childish 


PERSONAL CARE 
Fastidious in dress, 


and cleanliness 
unkempt 


SPEECH ones 
Voice resonant, pleasing, well-modulated aoe ates 
| Hard, harsh, pinched 


; t 
| Expresses meaning directly, rommaa with facility 
Unable to express himself, ungrammai Ve 


Garrulous, over-talkative Brilliant in talking, wide vocabulary 
Rather voluble | 


lid, narrow vocabulary 
Seldom speaks of own accord Dull and sto! 
Reticent, taciturn 


SELF-ASSERTION 
Pompous and overbearing 


vo 
Complacent Daciei 
Self-confident and possessed Wavering 
Self-critical and deprecatory tible 

mbarrassed, bashful, self-conscious | Contrasugges 
Anxious, apprehensivo tuggestible 
Submissive, retiring 3 


CO-OPERATIVENESS it 
W Bere in every respect; enters into spir! 


Reserved and formal n 
Sonstramed and suspicious, outside the situatio 
urly and hostilo 
application 
„erupulous, punctual and regular in attendance and ap 
Industrioug 


Easygoing, indifferent 
azy and irregular 


ALERTNESS AND CONCENTRATION 
Intelligentiy attentive, wide-awake 
Oncentrated 


Absent-mindea 
Easily distracted, inattontivo 


TEST REACTIONS: PLANNING 
Analytical 
rious but unsystematic 
Trial and error 
Aphazard 


SNC TION emarks 
Wild ana unrestrained emotional behaviour and r 
1 and childish reactions, capricious 
Some loss of self-control and overt emotion 
‘Tous and unconcerned 
Serious, Philosophical 
Repressed and inhibited 


Profits by past experience 


Repeats same mistakes 


SPnorar, CHARACTERISTICS 
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i f two 
i and adult criminals make an average o is 
ee en aan such errors as normal controls, but = 
se not yet been independently confirmed. en a aaa 
example, finds few differences between neurotic nae ahaa 
adults, though different types of neurotics show Soa 
acteristic differences, particularly in speed of maze per REEE oo 
A series of measurements indicative of emotiona sea scales 
psychomotor and performance tests, together with rati g, eating 
were devised by Biesheuvel? and his associates in pee 
South African Air Force pilots during the last war. Itis ine 
that these helped in choosing men with suitable ana For 
as well as aptitudes. However, the evidence is equivo 2 gre 
similar methods tried out in the USAAF gave very railing 
correlations of -1 to -2 with the criterion of passing ps a 
pilot training. In fact they were scarcely superior to ms oe ted 
based on appearance alone.3 Similarly M. D. Vernon 4 a ren 
to develop methods of gauging liability to breakdown 35) at 
stress. Subjects worked at a dotting machine (ef. Re a t 
high speed for several minutes, and at other tests like = in 
induce strain, and records were made of their pet i 
Successive half-minutes, From the upward or downwar' eee 
irregular trend, and from qualitative observations, a ar ae 
subjective rating of stability was reached, But no good evi bl a 
of the predictive value of these “trend tests’ is a 
Other more objective techniques are described in Chap. VI. H 
Even in group Paper-and-pencil tests of abilities hee 
Some scope for individual differences in manner of performance 


r 
which may express the testees’ personalities without maa 
being aware of it. Some people check multiple-choice Re rd 
only when they are certain, others guess more wildly. Guil an 
and Lacey 5 have shown that a consistent error-score fac d 
(distinct from ability at the tests as such) may be extracted» 
* Foulds, G. A., ‘ Temperamental Differences in Maze Performancr 
Part I. Characteristic Differences Among Psychoneurotics’. Brit. 
Psychol., 1951, 42, 209-217. nd 

* Biesheuvel, S., ‘An Observational Technique of Temperament @ 
Personality Assessment, 


Nat. Inst. Personnel Res. Bull., 1949, 1, a 
* Guilford, J. P., and Lacey, J. I., Printed Classification Tests. AT 
ae Forces Aviat, Psychol. 


4 Cf. Vernon and Parry. 
cit, 


Expressive Movements 67 


eat they term carefulness >, In her research on student 
ction, Himmelweit 1 found such an error score to add 
aed to the prediction of subsequent degree results. 
ee? this might be useful too in selecting grammar 
tried. pupils by objective group tests, but it has not yet been 
E Te une pupils and their teachers would, of course, have 
ain: ene in ignorance, otherwise the very careless might be 
In to modify their test behaviour.) 
indie conclusion it would seem that the evidence for objective 
jud & is somewhat less favourable than that for subjective 
Bu = ents of manner of performance by experienced testers. 
much more research is needed. 


eld, A., ‘ Student Selection—An 


ıHi } 
Himmelweit, H. T., and Summerfi 
. Sociol., 1951, 2, 59-75. Cf. also 


R 7 

paberimental Investigation, II’. Brit. J. 

Intellie W. M., “A Study of the “ Caution ” Factor and its Importance in 

ee Test Performances’. Amer. J. Psychol., 1924, 35, 368-386. 

UA nal T E., ‘ Personality Differences in Intelligence Test Performance ’. 

eaghbl. Psychol., 1925, 9, 230-255. Fruchter, B., ‘Error Scores as a 
ure of Carefulness’. J. Educ. Psychol., 1950, 41, 279-291. 


V 
Simple-Behaviour and Cognitive Tests 


qp is no hard and fast distinction between the indie 
tests of the previous chapter and many of the eine 
described below such as oscillation, will-temperament, dot at 
machine, and the group observation methods. However, n A 
of the tests in this and the subsequent chapter were devise a 
measure directly, or to sample, particular traits or types 


: d 
behaviour. It will be apparent that early workers in the fiel 
of objective tests (probably influ 


enced by the highly analytical 
trend of German experimental psychology) resorted chiefly ly 
very simple Sensory, motor, or ideational tests. Particulari 
in Britain, the development of mental testing was domina en 
by C. E. Spearman, and most of the work on personality, ape 
from that of Burt, was restricted by his narrow views ‘ 


. . n 
temperament. He laid it down that there were three mai 
factors or dimensions 1: 


P—perseveration, the tendency to inertia, or hang-ov 
effect, in mental Processes, as contrasted with = 
ability to switch quickly from one process to anothe ; 

f—fluency or quickness and richness of mental association’ ? 

o—oscillation or Variability in the performance of any tasK- 

The very elementary tests of these factors, together with 

the analogous temperament tests constructed by Downey 

in America, were generally unreliable and highly specific, 

They showed little overlap with one another, and they h# 

scarcely any bearing on the personality qualities of everydaY 

life. Thus the majority of more recent tests, as shown IR 

Chap. VI, have approximated more closely to natural samples 

of behaviour, 


{ z 
ae also, of course, Webb’s W or will-character factor, but vB 
Sn ased on Tatin; , not tests. A useful summary of the work of ory 
Spe man school is given by Wynn Jones, L., An Introduction to The 
Tactice of Psychology. London : Macmillan, 1934. 
68 
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SPEED AND FLUENCY 
Me first question to ask is whether some people are consist- 
o 3 quick or slow in all their actions and thoughts. In other 
oa s, is there a factor of personal tempo, possibly related to 
TEN ert-introvert temperament, which could be measured by 
a time, tapping or other dexterity tests, or speed of 
writ ing, or quickness at intelligence or other tests? The 
A has discussed the evidence elsewhere, and has shown 
and considerable consistency does exist, both among manual 
to cognitive tests, but that to a greater extent speed is specific 
me particular kinds of activities. For example, different 
tan aes of reaction time correlate very closely, also different 
mae tests ; but reaction times give much lower correlations 
nor tapping. The same is true when records are made of 
as ao speed of movement, in walking, speaking, writing, etc., 
ect from maximum speed. Thus Rimaldi repeated and 
est: nded Allport and Vernon’s investigation by gige A 
eal of normal speed to a group of students. He analyse i 
ty ts into a series of multiple factors representing di eren 
Ye of speed. But in fact almost all his inter-correlations were 
eee so that it would be justifiable to recognize & eo" 
Ricci, al factor, together with group factors for quickness & 

he ed o activity. wE 

er, Notcutt,? and the reser 

En correlations between oi speed or fluency teei ana 
clai aversion among children and adults ; and Kretsc ne 

Er quicker movements and reaction times m PY nee 
Point in asthenies has had some confirmation. Himme Be D 
accu; s out that most dexterity tests can be scored. for sp EY 
or racy, and that these two measures are largely, indepen i 

even negatively related. When a choice 1s given, hysteri 


a Abilities. London: N 


nt writer have obtained 


{ethuen, 


iy, 
1850. aE E., The Structure of Humar = 
2 p; “t also Allport and Vernon, ibliography- N 
2ap ggg H. J. A.‘ Personal Tempo’. J+ Abn Soc. Psychol., 195 
xtrovert Responses to Certain 


03. 
1308. ~ 
Sch 
Test si egler, R. A., ‘A Study of Introvert-E: 
i i rant utt, B. 
* Persevaations 5. peach, Coll. Contr. Educ. 361. Noteutt, ©» 
4 Himaction and Fluency ’- Brit. J. 
Temp Melweit, H. T., ‘Speed and ‘Accuracy 


Pe: 
ai >.” Brit. J. Psychol., 1946, 36, 
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ics 
patients tend to do better on tests scored for speed, dysthymi 
tests scored for accuracy, Sy! TA 
FE the intellectual, as distinct from the motor, field it is ral 
difficult to separate speed from ‘ power’, i.e. from ae ae 
intelligence, However, such tests as those mentione 


Writing as many words as possible in a minute beginning 
with the letter S; or words ending with ‘ tion ’; Pe 
riting as many names of animals, birds, plants, 
possible, in a minute each ; J. of 
riting adjectives to describe a house; listing names 
round objects ; 
iving associations to inkblots ; in 
Speed of free word association, or numbers of. words 
continuous association ; 


Suggesting objects which could be inserted at a certain spot 
in a picture, 


Normal Speed of reading, and 
Positions, or in building word 
also been used; and sever. 


o surgency among normal subjects, Probably this is 


o (cf. p. 5), for no one else a 
approached this figure. Petrie 4 found that f tests do no 
differentiate between hysterics and dysthymics, and an extensive 
and thorough Tesearch by Rogers 5 gave negative results. He 


Mental Testing. London: University of 
enson, W., Studman, G. L., et. al., * Spear 
istry’. Brit. J. Med. Psychol. 1934, 14, 101-135. 
Thurstone, L, L7 “Primary Mental Abilities > Psychometr. Monogr., 1938, 
1. Rogers, C. A, 4 Factorial Study of Verbal Fluency and Related Dimen- 
ee, Personality. Ph.D. Thesi 
cut, 


esis, University of London, 1952. 
* See Eysenck, Bibliography, 5 Op cit. 
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Se a large variety of fluency tests and personality ratings 
ae ee group of 14-year-olds. His oral fluency tests 
an show some relation to ‘surgent’ traits, but his written 
ae pene atall. It has been shown by Bousfield and Sedgwick + 
en e production of responses in a typical fluency test tends 
a ow an exponential curve, and the constants for such a 
= e might be identified with the total reservoir of the subject’s 
aep onses and with their rate of exhaustion. Rogers found no 
ee nce that these more analytic fluency measures gave any 
er correlations with personality traits. 


INTELLIGENCE AND PERSONALITY 
] intelligence itself overlaps 


d moral sides of personality. 
dies of Genius,? the gifted 


Nee mentioned in Chap. I, general 
some extent with the emotional an 
pie, in Terman’s Genetic Stu p 
ice tended to have a wider range and better quality of 
ite than average, to be superior on character tests and 
hi mee and in emotional adjustment (though those with the 
an He 1.Q.s of all were somewhat more liable than the rest 
dete low difficulties of adjustment). Conversely, mental 
eres are certified on grounds of social inadequacy and 
qu el instability as well as of low intelligence, and these 
qualities are to some extent associated.® Among adults, again, 


l i 3 . . 
eng igence is quite clearly connected with cultural interests, 
small positive correlations have been found with such 

tolerance, liberal, or progressive 


atti ; à A 
titudes as internationalism, 


Opinions, ete.4 
pathe extremely tangled situation exists in the field of psycho- 
sin ology, and we can review it only rather superficially, 
ns its bearing on the assessment of normal personalities is 
of poousfield, W. A., and Sedgwick, C. H. W., “An Analysis of Sequences 
patrioted Associative Responses », J. Gen. Psychol., 1944, 80, 149-165. 
k ee was first put forward by Thomson, G. H., and Thompson, 
Vocal, Outlines of a Method for the Quantitative Analysis of Writing 
A Seeds Brit. J. Psychol., 1915, 8, 52-09. oy 
rd, Calif. : i ity Press, s 
3 Stanford Universi ion of the Vineland Social 
6, 283-293. O’Connor, N., 


25, 205-211. 
Political Attitudes °. Amer. 
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limited. On most intelligence tests the scores of neuroties oe 
little if at all from those of normals; but psychotics, espe are 
those of the organic type, are distinctly poorer. — Perform ead 
declines also in aphasic conditions, or with brain injury, ong 
to some extent with age; (indeed much of the deficit Hoa 2 
psychotics may be an age effect). But this deterioratio 
certainly more marked on s ki à lized ’ 
is usually least on tests of what Cattell 2 calls crystalli Ei 
intelligence, such as Vocabulary, whose level represents, z of 
Were, what intelligence has achieved in the past. Tes 16 d 
‘ fluid ’ intelligence, which are most affected, include spect : 
i > ‘traction or ee, 
memory (e.g. of digit series); 
"es ee devised whic h 
old’ and ‘don’t hold’, om 
the Shipley-Hartford, and th 
Unfortunately such indices are not very 


contrast scores on tests that ‘hi 
example, the Babcock-Levy, 
Wechsler-Bellevue,3 
reliable, 


they give very low correlations with one another in individua 
diagnosis, (It is seldom reali i 


atistically than are the separ 
erived; also, that two such scor! 


» J. MeV., Bibliography, and Brody, 
elligence Tests in Psychosis ’. 
sychol., 1942, 19, 215-261, 


y T asurement of Adult Intelligence’. Psychol. 

Bull., 1943, 40, 153-193. i r 

* Babcock, H., and Levy, L., Revision of the Babcock Examination fo 

easuring Efficiency of Mental Functioning. Chicago, Ill. : Stoel 

: ga A Self-administering Scale fo Measuring Inte” 

lectual pone and Deterioration 2 F Psychol., 1940, 9, nz F 

5 Measureme 7 imore, Md. : 

Williams and Wilkins, di 5 on ary oe 
Anse 
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type of deterioration differs in different psychopathological 
conditions, and elaborate systems of diagnosis have been based 
on discrepancies between the various Wechsler-Bellevue sub- 
tests, concept-formation tests, particular Stanford-Binet items, 
and so forth. Rabin and Guertin ® have surveyed the Wechsler 
literature, and concluded that there is little or no prospect for 
the success of any mechanical system of differential diagnosis 
based on profiles of performances on cognitive tests. Yet at 
the same time it may well be true that an experienced clinical 
Psychologist can, from studying a patient’s pattern of responses 
to diverse tests, obtain useful insights into his mental condition, 
and thus assist in making the psychiatric diagnosis. In spite 
of the enormous amount of material available in clinics and 
mental hospitals, and the hundreds of investigations that have 
been published, we still seem to have no thoroughly established 
quantitative indices. The contradictory and confusing results 
arise, no doubt, partly because patients in any one psychiatric 
group are never homogeneous and entirely distinct from other 
groups, partly because of the influence on test performances of 
Poor co-operation during mental illness, and partly because of 
the difficulties of making allowance for age and previous 
education, 
i The same conclusion holds for v: y or; e J 
different tests. Many psychologists working in Child Guidance 
linies believe that a wide scatter of passes and fails on the 
tanford-Binet or Terman-Merrill scales is indicative of 
maladjustment ; and numerous measures of variability for this 
or for the Wechsler scale have been proposed. Jastak * has 
discussed the evidence, and concluded that Binet scatter 1s not 
lagnostic, although certain items such as Digits Backward 
and Memory for Designs tend to offer special difficulties to 
abnormal children, Unevenness in the performance of recruits 


ad aus intelligence, mechanical and education OH was 
tudied duri ould not be found to have any 
; uring the war, and c ive Matrices test 


Significance for personality. In his Progress! 


ariability or range of scores on 


a of Rapa i afer, Bibliography- 

® Rabin, rita a a on ONL, "Research with the Wechsler- 
Bellevue Test: 1945-1950 °. Psychol. Bull., 1951, 48, 211-248. J 

Jastak, J., Variability of Psychometric Performances n Mental Diagnosis. 

New York: J. Jastak, 1934. ‘ Problems of Psychometric Scatter Analysis ’. 


Psychol. Bull., 1949, 46, 177-197. 
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(1938), Raven 1 provides an index of unreliability or Ber ty 
of score pattern; this too appeared to bear no relation 4 
neuroticism. Other kinds of variability will be considere 
later when we come to Spearman’s oscillation factor. F 
Is the difference between Vocabulary or v-tests, and E 
abstract intelligence or spatial (g and k) tests of any value a 
personality assessment ? Several studies suggest that scores 2 
Kohs Blocks, Porteus Mazes, Picture Completion, also Ener 5 
sive Matrices, are relatively high among delinquents, ae 
lowered among neurotics. For example, Earl ? claims that 4 
profile of scores on Binet Vocabulary, a verbal absurdities ae d 
Kohs Blocks and Dearborn No. 3 Formboard, is particula gy 
useful ‘(with the assistance of qualitative observations) 5 
diagnosing instability and social inadequacy among an, 
Himmelweit,3 however, finds the ratio of Vocabulary , 
Matrices scores to relate to introversion, dysthymies obtaining 
larger ratios than hysterics. y 
Another possible lead is provided by the concept-formation 
or sorting tests of Goldstein and Scheerer, Weigl, Vigotsky, 
Berg, and others.4 Brain-injured and schizophrenic patien 
it is claimed, can cope with these at the concrete or perceptua 
level but cannot realize or formulate abstract principles © 


classification, Actually no one has proved that these ang 


anything more than rather unreliable tests of g, though i 


conceptual appro: i 
the typical sorting test the sub 


objects which can be classified in various ways (for example, 
wooden blocks of different shapes, sizes, and colours), and he 1 


t or to pick out those which are similar to 
certain standard objects. If he has mastered one principle © 


1 Raven, J. C., Pro; ressive Matri £ i Peri 
? Earl, C. J.C 8 atrices. London: Lewis, 1 


+ “A Psych D A . Psychol., 
1940, 85, 42 ’ Sychograph for Morons ’, J. Abn. Soc yi 
SER 


elweit, H. T.,‘ The I 
of Temperament ”. Char. & P: 
* Goldstei 


p ; : Psychological Corporation, 1945. Hanfmann, E.» 
A Study of Personal P al Performance". Char. & 
ple Objective pea 
ing’. J. Gen. ., 1948, 39, 15-22- 

° Cf. Hanfmann, op. cit. Gen. Psychol., 1948, 
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classification, e.g. by colour, he is given a new and more complex 
problem, and the process is repeated. Such tests are thus 
considered to involve not only the capacity for abstracting and 
generalising, but also flexibility in shifting to fresh principles. 
Whether or not this is true, flexibility and its converse—rigidity 
or inertia of mental processes—have played an important part 
in the development of temperament tests. 


PERSEVERATION AND RIGIDITY 


f Some of the earliest tests were based on sensory functions, 
Or example, slowness of dark adaptation, or low speed of fusion 
of colours on a colour wheel. It was claimed that melancholic 
patients showed greater perseveration at such tests than manics, 

ut this was not borne out by later research. (Nor, incidentally, 
2 any connection been found between any perseveration tests 

nd the perseverative tendencies that occur in certain types of 
2 eychosis) More popular were motor perseveration tests, 
N ere the subject writes certain letters or figures at maximum 
es: and then reverses the letters or writes them in some 

nusual way. The decrease in speed of performance 1s taken as 
a measure of the perseverative effect of the normal performance. 


xamples include : 


Copying prose with and withou 
_ crossing the t’s ; K 
Doing simple 4-rule sums, but substituting + for — and x 
_ for + signs, and vice-versa ; 
Writing ss. . for half-minute, 22. 
SeSe. . for 1 minute ; 
Writing ee. . . forwards, and €e.. 
‘wrong’ end, then alternating. 


or drawing, which involves the breaking down of estab- 
shed eye-hand co-ordinations, is also sometimes regarded as 


ek There has been much dispute as to the best De 
er ent, or hang-over ; all the scores tend to be affected by 
is lal speed of writing or copying, unless appropriate correction 

Made! After correction, the correlations between different 
J. C., ‘ The Influence of 
Brit. J. 


t dotting the i’s and 


. for half-minute, then 


starting at the 


1 Cf. Walk ; 
in: er, K. F., Staines, R. G- and Kenna, J. | I 
Serr, Methods Upon Score in Motor Perseveration Tests '. 


Psychol., 1945, 85, 51-00. 
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tests seldom reach statistical significance (unless the tests te 
very, similar), and they vary so widely in different researche 
using the same tests that it is extremely dubious ‚whether any 
genuine factor.of perseveration can be said to exist.! Lankes 
and Bernstein 2 obtained correlations as high as -5 between 
pooled test scores and self-ratings or assessments of persevere 
tive behaviour in daily life, but correlations with measures 0) 
introversion or of depressive tendency (with which perseverat 
has been identified) have generally been negligible. An entire y 
different claim. was put forward by Pinard,3 namely, that bof 

i perseverators are more difficult, 
unreliable, and lacking in self-control and perseverance than 
subjects obtaining moderate scores. In other words, persevera- 


firmation of this, 


ore recently both Walker, Staines, and Kenna and Cattell ® 
have stated that the alternatio ’ 
ess, but that ‘ creative effort 
reak down some well-established 
—) are more consistent, and that they 
sition rigidity’, Cattell obtained 
“4 between a battery of such tests 
r Integration, ep ly 
development is more flexible an 
He argues, too, that there are 
emperamental differences between Nordics 
So far no other evidence of the 
seems to have been published. 
` C£. Jasper, H, H., ‘Is Perserveration a Functional Unit Participating in 
3 Tocesses’. J. Soo, Psychol., 1931, 2, 28-51. Notcutt, B.» 
op. cit., p. 69, p 
B, quefseveration” Brit, J. Psychol, 1915, 7, 387-419. 
Bernstein, E., ‘Qui ; ja Hook 2 ogr. 
Sum, 1524, N o „üicknes and Intelligence ’, Brit. J. Psychol. Mon is 
t i NE / , 
Som u Tests of Perseveration’, Brit. J. Psychol., 1982, 
ba Cattell, R. B., ‘ The Riddle of Perseveration’. J. Person., 1946, 14, 
ë Walker, K, F., Staines, R. G., and Kenna, J. C., * P-tests and the 
Se of Mental Inertia °. Char. & Person., 1943, 12, 32-45. Cattell, 
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Yet another conception of rigidity is put forward by Lewin ” 
and Kounin,! who regard it as the main characteristic differ- 
entiating the personalities of feeble-minded from normal 
children. When given a choice of drawing tasks they tend to 
Persist longer at a single task, where normal children prefer 
a change. Perseveration was also demonstrated at card-sorting 
tests, and lack of flexibility on a concept-formation test, 
especially among defective adults. But how far such tests 
Tepresent a consistent trait of personality, or how far they 
merely reflect lack of intelligence, has not been investigated. 

Luchins 2 suggests that inefficient methods of education may 
Produce rigid rather than flexible minds, and he has devised a 
number of paper-and-pencil concept-formation tests. Subjects 
nTe given a series of problems and find that these can be solved 

y the application of one principle. Half-way through the 
Principle alters, but the original ‘set’ may delay, or entirely 
Prevent, them from realizing this. Note that the interfering 
Set in such tests is established temporarily during the course of 
he test, whereas in perseveration tests it consists of a fully 
®utomatized habit. An investigation by Oliver and Ferguson 
SUggests that the former largely depend upon g, whereas the 
ie do involve a separate factor. Frenkel-Brunswick ores 

at highly prejudiced or ethnocentric persons tend to be rigid 
°r intolerant of ambiguities in the perceptual sphere, ani 
proeriments by Rokeach with a problem similar to Luchins’s 
yvide some ra ight support for this. 

uchins’s rn en a ee aaiderable resemblance to some 
the tests, described later, which have been proposes) = 
Ineasures of suggestibility. The notion of interference = 4 
pe a great variety of psychological phenomena, for ae a 
Tetroactive inhibition, in positive and negative transtcr, 


5 i + . y- 
min gain, K., A Dynamic Theory of Personality. New York : Men 
Pers 1985, Kounin, J. S., ‘ Experimental Studies of Rigidity i iny 
is sta on., 1941, 9, 251-282. Their topological theory of perona X ee 
Evalusey” criticised by Werner, H., ‘ The Concept of Rigidity : 

ion’, Psychol. Rev., 1946, 58, 48-52. Dh 
“pichins, A. si nee Methods of Studying Degrees of Rigidity 
orson., 1947, k 
Rig ieo: Ken G. A., ‘A Factorial Study of Tests of 

s ‘Ade Y’. Canad. J. Psychol., 1951, 5, 49-59- 
Personala T. W., Frenkel-Brunswick, E., € 
ality. New York : Harper, 1950. 


of 


t. al, The Authoritarian 
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pars a e 
memory for incompleted tasks,! in conditioning, ana fare: 
sets observed in psychophysical experiments. In wae but 
there are, no doubt, individual differences in fest oe oe 
it seems extremely unlikely that they all depen Be: 
and the same trait.2 Nor is there any good i agai; 
any of the tests we have mentioned reflect any ori 
important trait. Even the traditional p tests are so art 
to the subjeets, so troublesome to give and score, 


3 ke 
such dubious validity, that no psychologist seems to ma 
any practical use of them, 


OSCILLATION AND VARIABILITY 


G in a large 
Fluctuations of performance have been measured in a 


3 5 . PP sums, 
number of tests such as reaction time, simple addition 
cancellation, muscle tone, 


ely from their lack of experience in doing 


oe her 
To measure oscillation in addition and ot 
similar tasks, the subject 


tick, say every 15 seconds, 


tion tests and a 
else were wholly specific, 
tests i 


id, are given by ‚Werner, op. eit., and Fish: 
in Research Deal; 3 


al 

Particularly in the abnorm: 
ing with Pers nalit: i 

342-351. ersonality Ri 


er, S., * An Overview of Treno 
gidity’. J. Person., 1949, 
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or z wets 

rn variability should be measured, but whatever the 

a a ano between variability at different (simple) 
generally so low as to cast doubt on the existence of 


any consi : 3 
teas pace trait.1_ Nor is there any good evidence that such 
iagnostic of emotional instability in general. One of 
found a correlation of 


t 
Be enle is that of Walton,? who 
here Tea oscillation among schoolchildren on 4 tests, and 
with eg of steadiness of character. In another research 
entiation bet ypes of variability, Connor 3 found some differ- 
steh e mest maladjusted and normal children, but no 
oon with tests or ratings of instability among normals. 
promising results obtained with more complex and 


realisti 
stic tests are described in the next chapter. 
rsals that occur when 


A 
bs Shy type of fluctuation is the reve 
Eare, ambiguous perspective figures like the Necker cube, 
extraverts F or shadow-windmill. McDougall 4 suggested that 
that this ave a slower rate of reversal than introverts, and 
alcohol rate is decreased by an extraverting drug such as 
. While there is some evidence of these tests differ- 


entiatj à 
ing manic from schizophrenic patients, they fail to 
and several 


disti 7 
Se ep ean and dysthymic neurotics ; 
extraversion-in ow no correlations with self-rating tests of © 
eier ntroversion among normals. — 
xceptions to our general condemnation of sensory and 
k and his 


Motor 
tests are provided by the work of Eysene 
rtain tests of muscular 


collabo 
a Tee They have shown that ce 
vs, Ba cn give quite promising correlations with normality 
e asso oticism. Presumably, good muscular control tends to 
Perform, en with emotional stability. Thus neurotics often 
rce c adly on the Track Tracer and the O’Connor Tweezer 
Gairen E R., Variability in Human Task Efficiency. Ph.D. Thesis, 
2 Wali 3, of London, 1950. 
BD locas Differences in Amplitudes of Oscillation 
1939, ‘onnection with Steadiness of Character > Brit. J. Psychol, 
32 80, 86-46 
1 Traits upon Intelligence 


Conn, . 
Test Perfon D. V., The Effect of Temperamenta 
È: rmance. Ph.D. Thesis, University of London, 1952. 
“The Effects of ‘Alcohol and Some 
iti », Med. Res. Counc. 


* Some 1 


Guilt : 
ord, J. P., and Hunt, J. M., 
_Extroversion ’. 


[3 
of M 
cD 
Psychol ougall’s Theory of Introversion 
+» 1982, 26, 324-332. 
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R hat 
Dexterity tests. Again, N. O’Connor and Tizard ! found t 


adau 5 loy- 
the Heath rail-walking test gave the best indication gt en 
ability ° or “work success’ in a large battery of tests ap 


i ce & 
to feeble-minded youths, This test? measures the distan 
subject can walk alo 


record is played reiter 
falling forwards . ne, 


À d the 
neuroticism factor have been obtained among adults, an 
test has some diagnos 


thenics, 
Form vs. colour 
S of coloured sh 


. to 
of details), Other tests ais 
apacity for Spreading the attention, as contra res; 
\ ntratin ingle task. The Gottschaldt figures 
in which the subject has to »: i 


t Ku : itary 
Implicati ©; R- ‘ Clinical Significance of Motor Defect, with Milita 
En as a Sr: J. Psychol., 1944, 57, 489 499. k: Appleton 
Century, logs,” YPnosis and Suggestibility. New York: 
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was that of Thurstone,! in which several factors were isolated, 
including: Perceptual Closure, Flexibility with Configurations, 
Susceptibility to Visual Illusions, Oscillation of Reversible 
Perspective Figures, ete. But there is no confirmation for his 
Suggestion that some of these factors might differentiate 
successful leaders or administrators, or good and bad readers. 


JUNE DOWNEY’S WILL-TEMPERAMENT TESTS 


This series of tests 2 had a considerable vogue in the 1920s, 
but the results were as unsatisfactory as those of p and o tests, 
and they are now regarded as little more than a psychological 
curiosity. A dozen traits were supposed to be measured by 
elementary tests based on handwriting, which were given In 
8roup form or, better, individually. For example : 


Speed of Movement.—Tested by normal speed of hand- 
writing. yi 

Flexibility —By ability to disguise writing, and to copy a 
model rapidly. p 1 

Care and Accuracy in Detail.—By time spent spontancously 
on disguises and accuracy of copying. 

Impulsiveness.—By tendency. to write larger and faster 
under distraction, e.g. with eyes closed. 


The tests were in fact based on a careful study of graphological 
oy tems, and Downey herself found that the pattern or profile 
of scores on the various tests provided an illuminating picture 

er subjects’ personalities. However; in numerous researches 

© Separate scores were shown to have low reliability coefficients 

ot around ‘5, and little or no agreement was obtained between 
se Scores and ratings by acquaintances on the relevant traits. 
Ossibly our criticism of the tests described in this gheni 

Ay seem unduly harsh, because their authors did not usua. y 
„Dect them to be diagnostic of everyday life personaly 
qualities, Rather they were concerned with fund 

Mperamental tendencies. We would say, however, that the 


“tempt to measure inborn factors was a misguided one. Most 


a 5 
Thurstone, L. L., A Factorial Study of Perception. 


Chica, 
go Universi 3 
2h, ity Press, 1944. r ; N.Y.: 
Wo Owney, J. E., The Will-Temperament and its Testing. Yonkers, 


"ld Book Co., 1924. 


Chicago, Ill. : 
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psychologists nowadays admit that intelligence can be tested 
only as a product of heredity and environment. Similarly, a 
the field of personality we can never really separate what ia 
innate from what is acquired, and-even if we could test t E 
former it would scarcely help us in any of the practical problem 
of predicting people’s behaviour, 


VI 


Miniature and Real Life Situation Tests 


Ines to obtain more realistic samples of behaviour, which 
Regal geld quantitative scores, the psychologist is usually 
Ike simplify and restriet the testing situation. This may 
react t onsiderable artificiality ; hence subjects may in fact 

o ‘miniature’ situations very differently from the way 


the z E 3 
y would react in more natural circumstances, Thus it very 
objective personality 


depending on slight 
in the attitudes of 


RER half of this chapter, few if any © 
Bence in the same straightforward manner as 
mentio and special abilities. The observational methods 
Conse ned later in the chapter are less restricted, but are 
nd ae ently more open to the vagaries of human judgment. 
Ravine too, constitute only limited samples of the general 
us ur tendencies or traits which we are trying to measure. 
esirable pointed out in Chap. I, numerous, varied samples are 
Ments e; and as each of these may require elaborate arrange- 
and materials, successful objective personality testing 18 


Chie; 
fly confined to experimental researches. 


PERSISTENCE TESTS 


N ki the first attempts was Fernald’s 

Woul ee simply in recording 
ime fo nd with his heels raised off the ground. 
group T normal students was 36 minutes, whereas among & 
Verne prisoners the median was only 15 minutes. Other 
as lon, S of this test depend on the subject’s holding his breath 
as possible, or sitting with one leg outstretched and the 


UF 
ernald, G. G., ‘ An Achievement Capacity Test’. J. Educ. Psychol., 


19 
12, 3, 331-336. 
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a ` Su a grip 
ised an inch above another chair, or sarah. ae 
en dynamometer equivalent to Lin Eysenck’s 
maximum grip. Several of these SR an = Be stability- 
En re De also show A a a 
A than hysterics. Howells 1 amiey ae which 
atten of tests of resistance to bodily pain an dende grades: 
showed some value in the prediction of pea e 
Obviously such tests have to be ‘ put across re might be 
to stimulate the subjects’ efforts, and their pA thought they 
seriously affected if, for instance, the subjects 
were being applied for selection purposes. 


«acts 
5 ich subjec 
Several other tests have been developed in which 
are given some diffi 


A ctua 
cult manipulative puzzle oine ded.? 
problem, and the time they are willing to spend on it en used, 
Wordbuilding or hard intelligence test items ne where the 
and a promising Number Series test for adults, 
subjects are told 


: le, 35 
that some of the problems are inso ER 
described by French.3 Here, too, the motivation of t RE ewha 
is vital. MacArthur 4 showed that results differ situations 
when the ndividually, or in Freee 
where a s pare himself with his $ ing was 
found it i ggesting that persistent Ternative 
© supply some similar “eher ha 
jects to go on to when 5 


P of the 
Ject can see only half an ineb aks) 
ylus. By inserting or Temora culty 
Set four problems of increasing 


oard around his st 
the experimenter can 


; » J. Abn. 80% 
1 Howells, T, H., “ An Experimental Study of Persistence,” J. 
Psychol., 1933, 28, 14-29. 

3 Cf. Hartshorne and May, and R 


? French, J, W., ‘ The Validity 
1948, 18, 271-277, 


g MacArthur, R. S., An Experime: 


Casurement at the Secondary Scho: 
London, 1851. 
6 


Morgan, J. J. B, 
J. Appl. Psychol., 1 


ans, Bibliography. hométt-» 
o£ a "Persistence Test’. Psyc 


its 
$ and? 
ntal Investigation of Piet SSE S of 
ol Level. Ph.D. Thesis, Univ: 


D 
Py tence = 

‚and Hull, H. L., ‘ The Measurement of Persis 

926, 10, 180-187, 
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the DoR 
ir a betog eee \ Instead of scoring objectively by time, 
ed pplies a 9-point rating scale which describes various 
to quit ee and hard work or of fiddling and wanting 
ian Heats re writer obtained good validity for this test among 
soluble substituted a very difficult fourth problem for the 
een in order to avoid tricking them. 
cis Heats conse that the correlations between such tests 
ches. o justify the conception of a common factor of 
Keeping-on. and claims that withstanding-discomfort tests and 
investigati at-a-task tests, etc., yield separate factors. But 
8 Suppe an such as those of Ryans, MacArthur ? and others 
together ae general factor in all types of persistence tests, 
Os 1 5 small group factors in particular types (cf. p. 14). 
eachers? ‚a attery of persistence tests correlates not only with 
but also or acquaintances’ ratings (at least among children), 
intelligenca)” scholastic success (over and above any effects of 
neurotieis eh and—inversely—with emotional instability or 


L STABILITY 


o elicit instability or 
king ’ situations than 
The McDougall- 


EMOTIONA 


S 

Variation. tests have been devised t 
ose aie of performance in more t provo 
c euere of o (oscillation) tests. 
Small circ] otting machine exposes to th 

han irregularly spaced, in each o 
Sooner peneil or stylus. The circles move faster and faster, and 
SYstemati later the subject breaks down. But he may continue 
ie lg dotting every second or thir 
Fa, ead and make feeble jabs, or give UP- 
VE the test to telegraphists, & 0 


Were n dots up to the point where five consecu ] 
ae These scores correlated from :33 to -46 with 
ieee: of neuroticism based on clinical interviews. 

les with obsessional tendencies, however; tended to score 
ysis of Tests Designed to Measure 


ITh 
Pere: Mton, G. R., ‘A Factor Anal 


Tsisten, 
Op nee” Psychol. Monogr, 1939, 51, No. 229. 
A 
(ramp Du ar, Culpin, M., and Farmer; E., ‘A Study of Telegraphists’ 
fice, pe Fat. Res. Board Rep» No. 43. London : H.M. Stat. 
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highly. The validity of this particular score requires con- 
firmation ; it may be that, as in Morgan and Hull’s Maze test, 
observations of manner of reacting to difficulties would be 
more useful. The fact that such motor co-ordination tests as 
rail-walking and manual dexterity are done badly by neurotics 
(ef. p. 79) gives some support. 

Cattell’s 1 C.M.S. (Cursive Miniature Situation) test is a much 
more elaborate version of dotting, in which the subject crosses 
out or encircles various kinds of lines and patterns which pass 
before him at a rapid rate. If he keeps his head and chooses 
the most profitable sets of lines, he can raise his score consider- 
ably. Possibly the task is a bit too sophisticated except for 


by electric counters. Kehr in Germany, and Freeman ? in 
d diseriminatory reaction tests which 


€wise put the subject in ‘ stress ° situations, and which appear 
to have given good results, Freeman compares the perform- 
i » with those under easy conditions, an! 
measures the time taken to recover efficiency after the stress 
Period. : 
Instability of performance at learning tests is also promising- 
Thus Ball finds that the learning curves of unstable or neurotic 
boys ata high-relief finger maze are much more irregular than 
those of normals. Rey 4 describes Similar results with a learning 
problem based on a kind of formboard which can be solve 
only by trial and error, N either test as yet gives a quantitative 


, 
LIE [B3 Ani Objeittve ‘Test of Character-Temperament ’- 
J. Gen, Psychol., 1941, 25, 59-73. 

‘Kehr, T., * Versuchsanordnung zur experimentellen Untersuchung 
einer kontinuierlichen Aufmerksamkeitsleistung ”. Zsch. f. ang. P. sychons 
1916, 11, 465-479. Freeman, G. L., ‘Suggestions for a Standardize 
“ Stress ” Test ”. J. Gen. Psychol., 1945, 82, 3-11. l 

? Ball, R. J.,‘ An Objective Measure of Emotional Instability’. J. APP! 
Psychol., 1929, 13, 226-256, l 

„Rey, A., * D’un procédé pour évaluer Pédueabilité *, Arch. de Psycho is 
1934, 24, 297-337. Cf. also Zangwill, O. L., “Some Clinical Application 
of the Rey-Davis Performance Test’, J. Ment. Sci., 1946, 92, 19-34. 
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ee instability; and inspection of the curves needs to be 
RER by qualitative observations of the children’s 
ae fo difficulties. But a more practicable and more 
and Meist est might be developed along these lines. Thiesen 
an er 1 report an experiment on maze learning under 
CR conditions—namely, no solution possible and 
el y the tester. Alterations in blood pressure and 
inability N) response during stress appeared to relate to 
genes, tolerate frustration, and to school adjustment in 
another te ut only 10 children were tested. Mirror drawing is 
oh lise, samning task which has, at various times, been alleged 
cael reedom from perseveration, or emotional control, or 
onesie ‘ousness vs. timidity, or yet other traits. Actually it 

X Sare moderately with intelligence; and in a research by 
first end a score based on the number of errors during the 
“impul ar trials correlated better with a trait-composite of 

siveness ’ than with one of ‘ emotionality ’. 


SUGGESTIBILITY 


A . 
betwe early as 1916 Brown? showed that the correlations 
Were en a number of tests supposed to involve suggestibility 

very low and irregular, and subsequent research has 


Con 
ce ue ed the lack of generality of this trait. Eysenck considers 
includes at least three distinct traits which he calls 
ity. Primary 


pri 
suggest „Secondary, and prestige suggestibil p 
on p Bbaity is best measured by the body-sway test, described 
bon” 80. The Chevreuil pendulum test of ability to hold a 
Similar Y at the end of a string, despite suggestions, 1s 
and oth; also possibly the autokinetic phenomenon. These 
ea er tests correlate with susceptibility to hypnosis and, as 
nd nd Mentioned, with neuroticism. (Note that all neurotics 
Only h be more suggestible, in this sense, than normals, not 
dysth: Ysterics. Contrary to common psychiatrie opinion, 
a obtain higher average scores.) nee 
3 ary suggestibility includes the numerous visual, 
‘A Laboratory In 
„Adolescent Children 


* Thies Hele 

Measures of J. W., and Meister, R. K., ertiga ton el 

SYchol,, q Frustration Tolerance of Pre ne 3 

Bro 049, 75, 277-291. F ven 

Univ, Calif W., ‘Individual and Sex Differences In Suggestibility ’- 
if. Publ. Psychol., 1916, 2, No. 6. 
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cutaneous, olfactory, and other tests, such as Binet’s progressive 
lines and weights, where the subject is led to anticipate & 
series of increasing stimuli; his suggestibility is measured by 
the number of stimuli that he judges larger after the actual 
increase has stopped. Acceptance of suggestions in recalling & 
picture (‘ Aussage ’ test), or suggestions that certain objects 
can be seen in inkblots, also fall under this heading, though the 
amount of positive correlation among such varied tests is low. 
Secondary Suggestibility appears to have no correlation with 
emotional traits or syndromes, but (at least among children) 
it correlates negatively with intelligence to a marked degree. 

‘ Prestige ’ suggestibility has been measured by giving an 
attitude or opinion questionnaire, then repeating it while 
informing the subjects that most people, or certain important 
people, answer each question in such a way. It is scored by 
noting how often the subject changes his previous response IN 
the direction suggested. Ferguson! has found that scores 
obtained from several such tests are moderately consistent. 
But there is no evidence to show that suggestibility to prestige 
in everyday life (e.g. to political speakers, to doctors or ministers; 


to newspapers, advertisements, or fashions) can be pr edited by 
tests of this, or of any other type. 


LEVEL OF ASPIRATION 


This is an important concept in modern personality theory; 
which refers to the goals or standards at which a person aims- 
Some people are highly ambitious, optimistic, or confident, 

le c actual capacity may be no less, are or 
realistic and cautious, or else unduly pessimistic and afraid 2 
failure, It has been studied by setting some task which provides 
considerable Scope for improvement, and in which the subje? 
can readily gauge how well or badly he is doing ; for examp ® 
» Substitution tests, or such manual tes 
This task is done for, say, ten 1-minu 


1 Ferguson, L, W., ¢ An Analysi i tibility 19 
son, L. W., ysis of the Generality of Suggestil 
Group Opinion °. Char. & Person., 1944, 12, 237-248. 


°C. Hunt, Frank, Biblio ibed 
t, ; graphy. The procedure here describe o 
wee by Himmelweit, H. T., ‘A Comparative Study of the Tev aiT 
piration of Normal and of Neurotic Persons’, Brit. J. Psychol., 1 


37, 41-59. 


s that 
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riods. ; 
(A); he i a eaen period the subject guesses his own score 
len a his actual score (B), and guesses what his score 
vibld a nn period (0). The differences between A and B 
et Pi of what is called Judgment Discrepancy, and 
Goal Disere tween. C and the previous achievement B yield a 
Dada Ree score. Other measures such as Responsive- 
isha: en ity are derived from the number of times the 
(C) go up or down in accordance with achievement, 


ee unaltered. 

n 

ee the results from different tests are rather 

AAD oven inconsistent, though Lewin 1 claims that this 

and the Bee by standardizing the technique and scoring, 

fevered ar ‘ivation of the subjects. There is some evidence of 

physicall pranon or discrepancy scores among cripples and 
y handicapped children, among pupils or students who 


are faili . G 
ing in their work, and students affected by poor economic 
than females. 


cire 

ee Males usually score higher 

and Jud eit 3 found that Goal Discrepancy tends to be higher 

hysterics u Discrepancy lower in dysthymies than in 

favourabl but the results of aspiration tests have 
n any e in subsequent researches by this author an 
i case the discovery of plausible and interesting group 


differ, 
e > 3 
nces does not necessarily show that the method is of any 
and Gardner * obtained 


real i 
very Noa for individual diagnosis ; 

(though meaningful) correlations between individual 

r of relevant traits. Thus it 


Scor a 
Seems I d ratings on a numbe: 
simplified the writer that aspira 
extremel and trivial situations ; 
Which a y chaney and have little bearing © : 
he a Sages s self-esteem operates in his real-life behaviour. 
Gilliland 5 v criticism applies to the early attempts of Moore and 
on the sub test aggressiveness > One of their tests was based 
ject’s ability to carry out oral addition sums while 


tion tests are based on over- 
that reactions to them are 
n the manner in 


as a Method of Studying 


1 

C 

f. Hunt, Bibliography. 
48, 11, 


SCH R 
P ersonalit, tter, J. B., ‘Level of Aspiration 
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Le er, J. W., ‘The Relation of Certaii 
ults with the Moore- 
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looking the tester straight in the eye; another on the sper 
and the aggressive or neutral content of free associations O 
such words as ‘ enterprise, success, danger...’ The correlations 
between the various sub-tests were low and sometimes negative, 
in a research by the writer with college students. Although 
small positive validity coefficients were obtained with a trait- 
composite for Dominance’or Leadership, the tests were clearly 
too artificial, and too greatly affected by the personal relations 
between tester and subject, to be worth following up. 


OBJECTIVE TESTS OF INTERESTS 


Interests are generally assessed by some form of question- 
naire (cf. Chap. IX), but Fryer’s book on interests describes & 
number of attempts to test them through objective measures 


of behaviour. Other possible approaches are listed by Cattell 
and Heist?; for example: 


Fraction of income and/or leisure time spent on each of a 
number of types of interest ; s 
Free association tests with words representing various 
types, scored by speed of response and/or psycho- 
galvanic reflex ; 2 

A large sheet of miscellaneous pictures and photographs is 
presented ; the subject studies these and is scored by 
the number he recalls belonging to different types ; 

Reading a passage concerned with an interest which 
contains nonsense words ; scored by failure to notice 
and cross out such words ; 

Subject studies pages containing statements relevant to 
types of interests, and irrelevant words ; strength of 
interest expected to be-shown by failure to recal 
irrelevant material, but the reverse was found. 


Cattell and his colleagues applied a dozen such tests in an 
attempt to measure twelve types of interests and attitudes 
among small groups of students. The inter-correlations wer 
so low that, though it might be feasible to build up a battery of 
the best objective tests for giving a composite measure of some 

* Cattell, R. B., et. al, ‘The Objective Measurement of Attitudes’: 


Brit. J. Psychol., 1949, 40, 81-90. ‘The Objective Measurement © 
Dynamic Traits’. Educ. Psychol. Measmt., 1950, 10, 224-248. 
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single type of interest, it would obviousl i 
. . be i 
23 en nie in assessing a Pa ad 
a el a la eh 
s objective test, however, which does off 

st > es O; u 
2 en is the test of information or dee A 
a eld. No doubt such tests depend on aptitude and 
Beker a well as interest, but they do tend to correlate with 
ne en and they do help in the prediction of 
Bt teas or e ucational success. Fryer describes a number of 
ane » | of mechanical information, which were 
nn in the American Army in 1918. Cattell! provides a 
fields : an short information tests dealing with the following 
Things pee Sport, Commercial, Mechanical, Scientific, 
Artistic A Mind, Rural-Naturalistic, Religious, Literary, 
= ee decorate) Sensual, Sex, Social, and Home. These 
Ben ciently thorough, nor well enough standardized, to 
oa mended for immediate use, but they provide a useful 
Ehe point. Tests of general information about everyday 

ical and electrical matters were of considerable value in 


allocati ER: 
ocating British recruits during the war,? and are being 


ad 7 
apted by educational psychologists in some areas for selecting 


bo 

en technical education. ‘The American Air Force psychol- 

ther’ + so designed information tests covering technical and 

successful fr of interests. Items which best differentiated 

out, and rom unsuccessful pilots and navigators were picked 

tributi used in a revised test, which made a valuable con- 
ion to aircrew selection.” 


P 
eel and Lambert 4 have published an ingenious combination 
n? test for measuring 


of i E 

eo mation and ‘ miniature situatio. 

different. = technical interests, to be used in selecting for 

questions ypes of secondary education. Several blocks of 

answer h if given, in each of which the pupil is told to 

technical, only. ‘Thus he may choo 
al, or mostly literary questions, and his 


1 Cati 
Londo; tell, R. B., A Guide to Mental Testing. London : University of 
a Cp nn 7e53,11080. 
CH Memon and Parry. F 
Army Arator; J. P., and Lacey, 
Ko notes Aviat. Psychol. Prog: 
“Peel, E. Government Printing Office, 
Educ. Psych A., ‘ Assessment of Interest 
sychol., 1948, 18, 41-47. 


se to answer mostly 
score is based 


J. I., Printed Classification Tests. 
Res. Rep. No. 5. Washington, 


1947. 
in Practical Topics’. Brit. J. 
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a i tness. 
on the proportion of choices as well as on their correc 
For example: 


Write the meaning, and if necessary make a rough N 
of three of the following words: monastery, 
actor, clawhammer, optician, hacksaw. 


ity is the 
Another cognitive test of relevance to personality ne tat 
George Washington Test of Social Intelligence. This gro the 
was devised by Moss, Hunt, and Omwake to N ae 
ability to understand and get along with people; it can Toll 
with high school or college students. The five sub-tests in 
Selecting the best res 
situations ; ts; 
Recognizing the emotions underlying verbal statements ; 
True-False statements about human behaviour ; ople ; 
emorizing and recalling names and photographs of pe 
Choosing the best completion to humorous stories. ; 
5 . a 
Primarily the battery measures the same general mar o 
ordinary verbal tests, but it has also given useful correla vaio’ 
around -4 with assessments or other tests of social extrave 


. : ; lem 
ponses in various social prob 


MORAL KNOWLEDGE OR JUDGMENT, AND 


CHARACTER, TESTS 


_ Large numbers of paper- 


in America,? including vocab eth a 
gonversel j ng terms; tests of biblical aa 
Teligious knowledge ; ranking offences in order of sea m ai 

ion tests (presented either verbally or Die SE 
ally), where children are asked the proper thing to do in V: 


moral situations, E.g. : 
If someone steals your lunch you should : 
Steal another lunch to even it up; 
Report it to the teacher ; 
Cry about it ; 
Say nothing about it. 


; Wash- 
* Moss, F. A., et. al., George Washington Social Intelligence ce ogical 
ington, D.C.: George Washington University, Center for Psy‘ 

Service, 1930. 


* Symonds (cf. Bibliography) provides an excellent description. 
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Such ; ee 
the en good reliabilities and inter-correlations, but 
east ey are heavily weighted with intelligence 
ehäracter B ey are unlikely to be highly predictive of ‘ good 
often ee 3 urt and others have pointed out that delinquents 
right si s well as non-delinquents what society regards as 
investigatio Tong. Nevertheless in Hartshorne and May’s 
eotrelated ee described below, a battery of tests, which 
Ohobjectivet with intelligence, did correlate ‘35 with a battery 
and judgm rs of honesty. In other words, the information 
ratings of nr tests were rather more valid than teachers’ 

Nest e same children’s honesty. 
eee cribed by Symonds, several miniature situation tests of 

Sit ee cheating were developed by Voelker, Cady, 
(or colle imer, and others in the 1920s. For example, children 
test ; Se may be given @ straightforward attainment 

: nown to them a copy of their answers is taken, and 


this is 
used to check the marks when they score their own 
d scores are compared with 


arked by the tester. 
es or carry out other 
these tasks are impossible 


e gives a measure of 
i list 


of book ti D 
EA a several of which are plausible but fictitious. The 
ishonesty. > claim to have read is again suppose 


Such 
ut a may strike the reader a 
research in the field of character, using these and 


man 
y other extremely ingenious tests, was of outstanding 
’s Character Education 


tificial classroom 
ns, which were 
for character. For 


s thoroughly obnoxious ; 


situati 
So a s, but also a variety of rea 
ged as to yield quantitative scores 


exam 

Jae games were arranged at parties which gave scope tor 

g. Children were sent on standard errands and given 
ed could be 


exces: 
atone en so that the amount they appropriat 
givin, Opportunities were provide for doing work for, 
Severa] ng away things to, other children, or for being selfish. 
ethica] of the persistence tests described above; together wit 
cf. p, ement tests, and ratings of children by one another 
3) or by teachers, were also employed. 
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When the results of different honesty measures, or persist 
or altruism tests, were inter-correlated, the agreeme De 
often so low that the authors concluded that there is no suaa 
thing as good character in general, rather that epee of 
acquire specific good habits in specific situations. Ye vaally 
the correlations were in fact positive, so that it is Cara 
legitimate to think of honesty, persistence, etc., as g ie 
underlying factors, which can be measured by combin nr; 
results of a number of different tests, that is by irait com DO 
Moreover, there was positive overlapping between en 
different composites constructed for measuring ho 2 
persistence, service and self-control, and A consistency Hy 
behaviour. Thus a kind of super-composite Ve 
character in general would be justifiable (cf. p. 14), alt that 
this manifests itself so differently in different situations 
no single test could be taken as a valid index. 


UES 
OBSERVATIONAL AND TIME-SAMPLING TECHNIQ 


Olson, Goodenou 
shown that it is 


ur, organized group play, Sr, aoe 

* Olson, W. C., and Cunningham, E. M., ‘ Time-Sampling Tec! iques fo" 
il ? -58. Thomas, D. S., Some New es f. 

Studying Social Behavior. New York: Teachers College, Colu! 

Publ., 1929, 


Cf. also Arrington Bibliography. ip, and 
*Parten, M., ‘An Analysis of Social Participation, Leadershi pogr : 

Other Factors in Pre-school Play Groups’. Instit. Child Welfar 

Ser. Minneapolis ; Unive: 


sity of Minnesota Press, 1931. 
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then observed each child for 1 minute a day for 60 days 
(distributing each child’s minutes throughout an hour’s school 
Period), and ticked off the category of behaviour into which he 
fell during that minute. A score for social participation was 
thus derived from the numbers of minutes in which he had been 
engaged in each category. It was found in this and other such 
Studies that time-sample scores possess good reliability ; for 
example, participation scores on odd-numbered days correlated 
highly with scores on even-numbered. Moreover, they predict 
future behaviour of the same type, OF else other types which 
One would naturally expect to be related (e.g. leadership, 
talkativeness, laughter, and physical activity) in a consistent 
fashion, Time-sampling is reliable or consistent too, in the 
sense that two observers making records of the behaviour of 
ne same children agree very closely provided that they are 
horoughly trained, and that the behaviour is defined sufficiently 
unequivocally. Thomas shows that such consistency 1S higher 
hen highly specific and objective activities are recorded (e.g. 
sal physical contacts with other children) rather than acy 
ich involve some interpretation (e-8- number of a 
Contacts), But the latter are certainly more meaningful an 


aa for personality study. R 
childs technique is mainly applicabl 
TEE both because their behaviour 1s 
older children or adults, and easier 
self because they can readily be obser’ 
“conscious ; (if a one-way observation SC 1). It 
nit need not know that they are being watched at all). 

een extended to older children in a res icki 
© recorded the ran habits, nail-biting, no Ea 

3 pad-seratching, tics, ete., of children in class, unknown 
a em. Though it was difficult to get high observer-consistrne 
arcliable total score was built up, which bore some z Se a 
suchers’ ratings of behaviour difficulty. Newoon)joverted 

milar technique to a study of extraverted and intro 


e among very young 
less complex than that 


a pe 1 
ching? W. C., ‘The Measurement of Nervous Habi ete 
of Minnest Instit. Child Welfare Monogr. No- 3. Minneapolis : 

sota Pr. i f t 
Beh ewcomb, a Consistency of Certain Extrovert en 
Dior. Patterns in 51 Problem Boys’ Teachers College 
r. Educ., No. 382, 1929. 
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; with 
behaviour at a boys’ summer camp. It might well be ele 
adults in a factory situation or committee 3 indes Oben 
the time and motion and accident studies, with ass tom 
tion, and with recent work on social dynamics re, 
these are not, of course, concerned with personali m ae a 
Such research is much more troublesome than wou m 
first sight because of the difficulties ‚of defining | acon 
behaviour sufficiently rigidly, of securing impartia E 
sistent observers, and ensuring that their presence PEE t 
affect the behaviour of the individuals they are z e in, for 
Nevertheless it has yielded results of the greatest va ne enka 
example, Murphy’s study of sympathy | and aggre: of the 
among young children, D. E. M. Gardner’s comparis ee 
personalities of 5- to 10-year-old children taught in Be effects 
and orthodox schools, and Lewin’s investigations of the "ae 
of frustration and of authoritarian, laissez faire and demo 
club leadership on boys’ social behaviour.1 


GROUP OBSERVATION TECHNIQUES 


Burt describes a research 
War, where 53 19- 
number of traits ( 
ship, Delinquent 
observers em 
were validate 

e average 
“44, and for t 


e Jd 
carried out during the Firet yor a 
to 14-year-old children were asese ender- 
Emotional Stability, Extraversion, endent 
Tendencies, ete.) by several nee ments 
ploying different techniques. These ju Stings: 
d against exceptionally thorough teachers 


jews was 
coefficient for judgments 


based on interge othe 
hose based on projection, questionnaire, & 


coeffi- 
ly -27. Considerably more successful the eciall; 
ging :54—were observations of children in sp 


a vis! 
arranged, but natural, situations such as a tea-party and 
to the Zoo. ‘Q 


$ jses 
; tle er! 

n these occasions a number of stock lit 
were stage-man, 


: jca 
aged, so that each child’s reactions v Tails 

everyday emergencies could be observed.” No furthe lie‘ in 

are given, but similar observational methods were app 


; york? 
vior and Child Personality. t New ails in 
7. Gardner, D. E. M., Testing us S 


itt, Be 


ds. London tally 


F imen 
and White, R. K., * Patterns of Aggressive Behavior in Experi 
Created “ Social Climates ” >, 
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assessing students engaged on biological field-work.! Clearly 
these investigations were forerunners of important later 
developments. 

In the 1930s, German military psychologists devised elaborate 
methods of officer selection based less on objective tests than on 
qualitative observations of expressive movements and of 
behaviour in situations involving stress. In some of the tasks 
candidates had to drill recruits, give instruction or short 
lectures, or carry out complex orders requiring quickness of 
uptake, physical agility, or endurance, or improvization in 
emergencies. Their reactions were observed and interpreted by 
trained psychologists. No good evidence of the validity of such 
Methods was ever collected, and it is only too likely that the 
tasks were too artificial or the judgments too subjective to be of 
much value. Nevertheless, during the war, British psychologists 
in War Office Selection Boards, and American psychologists in 
the Office of Strategic Services,” developed similar techniques, 
which combined objective tests of abilities, questionnaires and 
Projection tests of personality, with interviews by military 
Officers and psychiatrists, and with observations of behaviour 
at certain group exercises. The W.O.S.B. technique has 
requently been described. Groups of about eight candidates 
Or commissions were studied over a 2- to 8-day period by a 
Senior army officer, a military testing officer, a psychiatrist, and 
a Psychologist, who at the end pooled their information before 

€ciding on the suitability of each candidate. The exercises, 
Which were watched by the testing officer and sometimes by the 
Other staff, were designed, not to bring out particular traits 
leadership, co-operativeness, initiative, ete.), but to be 
analogous to some of the common social situations of army life, 
and to afford opportunities for observing how each candidate 

haved in a small group. Ina typical ‘ leaderless-group ‚test, 
Or instance, the group is assigned some task such as Oe 

€avy object over a set of obstacles, and is left to work out 1 


own. soluti i merely as passengers, 
olution. Some candidates behave yi DENE 


Others t 5 ] e sei 
ry to dominate the rest, while som i 
Come to the fore, though acting in the group’s rather than their 
Per Burt, C. L., “The Factorial Analysis of Emotional Traits’. Char. & 

„son., 1939, 7, 238-254, 285-299. 


?Cf. Vern 
= on and Parry. Also, 
Assessment of Men. New 3 York : Rinehart, 


Office of Strategie Services Staff. 


1948. 
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own interests. Note that this is not an objective test of any 
quality. It falls rather under the indirect or expressive methods 
of Chap. IV, since the candidates would usually assume that 
their ingenuity rather than their social response was being 
tested. And, along with the other exercises and interviews, 
it yields, not scores or measurements, but subjective ratings 
by the observer of the candidate’s personality as a whole. 
Hence it depends enormously on the skill and experience 
of the observers. Other weaknesses have been pointed out 
by the writer elsewhere, for example, the obvious depend- 
ence of the candidate’s behaviour on his interpretation of 
the procedure and his preconceived notions of the sort of 
personality that he should try to display. Thus the agree 
ment between different observers, or between ratings given 
on two or more occasions, is nothing like so high as that of 
time-sampling ; it is usually around -6 to -7. Nevertheless 
this is somewhat higher than the reliability of judgments based 
on interview alone, or of ratings based on general acquaint- 
ance and on casual, as contrasted with directed, observation- 
Various follow-up studies of candidates selected by these 
procedures have given quite low correlations with the officers 
subsequent success at a training unit or'in the field. There ar& 
however, extreme difficulties in securing a reliable criterion ° 

Success ’, and validation is necessarily confined to the highly- 
selected candidates who have actually been chosen ; i.e. there 
© no way of proving how unsuccessful the rejected candidates 
would have been. One can at least state that the method 15 
Superior to the older methods based on interview alone. More- 
over, it has the tremendous advantage of appearing to Ly, 
fair to the army, thus stimulating the supply of candidates 
and improving the confidence of senior officers in the 
subordinates. 


It is because of this high ‘ face validity ’, and the acceptability 
to candidates and to users, not because of their proven value, 
that group observational procedures have had such a vogue 
Since the war. They are still used, in somewhat abbreviate 
form, by all three Services for officer selection. The ma 
nt adaptation was in selecting high-grade civil servari “ 
or the administrative class and foreign service (where ve 
ordinary interview plus academic examination method 
selection was obviously inappropriate for assessing men an 
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women whose careers had been interrupted by war service). 
Here the analogous exercises were designed chiefly to bring out 
desirable qualities of intellect, for example : free discussions 
on a given topic among a group of candidates, giving a short 
lecture, studying a brief and expounding it to a committee, 
ae acting as chairman to a committee. Judgments based on 

ese situations were supplemented by the study of objective 
test and examination results, background data and references, 
end by interviews. It was possible to demonstrate quite high 
validity for the procedure as a whole when the selected candi- 
ates were followed up after 2 years in the Civil Service (cf. 
P. 29). Correlations of around :6 were almost as high as the 
Correlation between the Civil Service’s own judgments of 
Suitability after 1 and after 2 years. In other words, the 
systematic 2- to 3-day procedure was nearly as accurate in 
°recasting success as was casual observation of the whole of 
eed first year’s work. But it should be stressed that up 
a ereises, although quite time-consuming, supplied only abou 
auarter or a third of the material on which the final choice 
RSA, made. Moreover, the staff were exceptionally nen 
that Y Stable in composition. It certainly does no a d 

at any ad hoc adaptations of the “ house-party ° methor, 


either 5 . i ally successful. 
Y psychologists or laymen, will be equ y 3 

Both the Arny An the Civil Service have departed tory poe 

Vv ‘Binal procedures, on grounds of economy, and satistac oe 
vidation of their present watered-down methods is not ye’ 
available P 

a ons to the 

Sele i 

co 


na 

3 grou i He reports 
wi P of candidates. pi ed 
wh the products of the method, but this 1s n 


r : E 
Scientific validation. Candidates ir bren similarly 


teacher train: 
dealt with ne and for youth leaders, 


a = ion Board’. 
On: Wilson, N. A. B., ‘ The Work of the Civil See ne Validation of 
Civil a Sychol., 1948, 22, 204-212. Vernon, °° Psychol., 1950, 2% 
75_g5 Tice Selection Board Procedures’. Occup. Psycho Occup: 

é > Occup: 
Payepate®s J. M., ‘ New-Type Selection Boards in Industry 
ol., 1947, 21, 170-178. 
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In one or two education areas, groups of 11-year-old he 
who are borderline candidates for grammar school places A 
been collected at a convenient centre for a day, and observ adn 
a variety of tasks not unlike the Army ea group ae 
group games, constructional, imaginative, and i Ehen 
activities. The teachers and psychologists who oa whith 
apparently reach an agreed judgment quite readily as Ks: 
children show most initiative, co-operation, and other = 0 
personality qualities ; but again there is no evidence ye ost 
how far this predicts anything relevant to ae Bere 
Success. Here, too, there is some danger of children Be; 
unnaturally, through a sense of the importance of hr oc “nl 
or as a result of coaching by schools or parents. One al 
have thought that a week or a month’s trial period in an een 
grammar school, with a teacher specially trained oe aaa 
their social and intellectual adjustments, would PE 
diagnostic; or indeed that a system of intermediate s' 


even 
before a final decision is reached at 13 years would be 
more effective, 


In conclusion : 
personality tests, 
the conventional i 
they provide a m 
which to observe 
as the interview 
the observer, an 
caution because 
measure of coni 
One might expe 
of behaviour at 
IV, because the 
their fellows, in: 
and as yet unsubstantiated 
the situation i 
stimulate self-co 


VII 
Ratings and Judgments of Personality 


Te obje ; 
that 3 Sr the rating method is to draw on the knowledge 
turn this into n’s associates have acquired about him, and to 
personality el estimates of his standing on "various 
nowledge iv s. Let us look first at the acquisition of such 
clusions eg soon as we meet a person we jump to con- 
Movements ce We interpret his features and expressive 
arrive at Sig any actions we see or words we hear, and 
Whole, One ae of picture or schema of his personality as a 
elp to fill erg contacts, observations, and conversations, 
ut When we and extend, sometimes to modify, this schema. 
mark roe SSe bi asked to rate him and give him, say, & high 
he much becau ility or a low mark for Dependability, it is not 
„ehaviour hen we have observed any particular pieces of 
N E generalize fi are representative of these traits, as because 
m. "Vations ae our total impressions. Sometimes certain 
ments : he ma and out in our minds and influence our judg- 
© call him un rf have failed to carry out some commission, SO 
rs more De u But usually a whole conglomeration 
A ound up in analysed recollections and emotional reactions 
be Siderably aed judgment. Earlier conclusions about him 
en formed ect later observations; once the schema has 
we tend to interpret what we see of him to fit in 


With ; 
it, 
Thus the schema is not an objective portrait or 
embody visual 


Sum 
‚Ummar 
i y 
of the person. Although it may I 
es an emotional 


Mages a 
attitude re verbal descriptions, it also involv 
asong ate towards him. Landis ! has studied the 
ten, good Be raters for their judgments, and pointed out 
TE o be ad reasons have little effect on accuracy: 
Ose rea] pee react in the psychoanalytic 5° 
atta © Own aan the rater is largely unaware . 
Schiene are of human nature and the meanings We 
various traits also affect our judgments. In the 
J. Personnel Res., 


ı 
Land; 
log, ndis, C 
> er ‘The Justification of Judgments 5 


nse, of 
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course of our lifetime, our analysis of self, our contacts with 
other people, and the books and newspapers we read or the 
cinema films and plays that we see, all help to build up in us 
a set of stereotypes or stock personalities—the typical athlete, 
the aesthete, the absent-minded professor, the pedantic civil 
servant, etc. We are very apt to fit each new acquaintance into 
one or other of these categories. Actions which fail to conform 
are often not noticed. Hence our schemas remain primitive 
and far too simple to cover the complexities of the personalities 
we actually meet. (An enlightening discussion of the develop- 
ment of conceptions of people among children and poorly 
educated adults is given by Watts.1 The interplay between 
our linguistic education and our understanding of people 
deserves much more study.) 

The result, as mentioned in Chap. I, is the halo phenomenon. 
Hither our general liking or disliking affects our judgments of 
what should be distinct traits; or our schema embodies sO 
strong an impression of one personality type or trait—say 
jovality, devotion to work, selfishness, or their opposites—that 
we interpret all other behaviour and rate other traits to accord 
with this. The subjectivity of judgments of personality 15 
apparent whether these judgments are expressed in a free 
description (witness the varied interpretations of Napoleon’s 
or Hitler’s personalities by different authors), or as ratings. 
Although the latter are given in more standardized form, and 
the rater is usually warned to avoid halo, yet discrepancies 
between raters are probably as great as, or greater than, 
between biographers because of ambiguities in the interpretation 
of traits to be rated, and variations in standards of judgment. 
Ratings are therefore best regarded as samples of the ‘ reputa- 
tion’ of the subject in the eyes of the rater. They are most 
inadequate as sole criteria of a person’s traits, or as the sole 
source of data for the scientific study of personality. Yet at 
the same time they cover a much wider range of more natural 
behaviour than any practicable battery of personality tests oF 
time-samples, and they have the tremendous advantage © 
being applicable without taking up the time of the subjects— 
even without their knowing anything about it, In an extensive 
research by the writer on a small group of students, the averag® 


1 Watts, A. F., The Lang: 


uage and Mental Development of Children: 
London : Harrap, 1944. 
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validity coefficient of sets of associates’ ratings, when compared 
with trait-composites, was +:60, whereas most of the better 
objective tests yielded coefficients between :30 and -45, Un- 
doubtedly, then, ratings are useful, and they should be included 
in any approach to the assessment of people, provided that certain 
precautions are observed. Indeed they have probably been more 
widely used (in the form of school reports and record cards, merit 
ratings in industry, etc.) and more thoroughly studied than any 
other psychometric technique except the intelligence test. 

It should be noted that ratings overlap with many of the 
more objective methods described in previous chapters, and 
particularly with the expressive methods of Chap. IV. There 


we saw that measurements of the speed, extent, pressure, etc., of 
as judgments 


expressive behaviour were seldom as diagnostic 
of the behaviour by an impartial observer. (Nevertheless 
there is much to justify the argument + that a properly weighted 
combination of measurements from such tests as Porteus 
Mazes, the Luria apparatus, or some form of dotting or stress 
test, would be more accurate than subjective interpretation.) 
Time-sampling and group observational methods are kinds of 
rating, and the former—when applicable—is superior to the 
latter just because it leaves SO little to the judgment of the 


observer. Some researches based on ratings such as Webb’s, 
ired the judges to observe 


onsiderable period ; and 


Several rating devices me 
rate more from direct observation tha 
recollections. These steps shoul 
fuller and more impartial, thoug' 
inate all halo, stereotypy: and bias. 


RATING TECHNIQUES 


1. Ranking and Paired Comparisons. A school teacher may 
arrange her class, or an officer his platoon, in order of merit for 
1 Upheld, for example, by Eysenck, H. J.» The Scientific Study of 


Personality. London : Routledge, 1952. 

2 Webb, E., ‘ Character and Intelligence ’. Brit. J. Psychol. Monogr. 
Suppl., 1915, No. 3. Burt, C. L., © The Factorial Study of Temperamental 
Traits’. Brit. J. Psychol. Statist. Sec., 1948, 1, 178-208. _Newcomb, T. M., 
‘ The Consistency of Certain Extrovert-Introvert Behavior Patterns in 51 
Problem Boys’. Teachers College Contr. Educ., 1929, No. 382. 
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ait. This is hardly applicable when the number of cases 

den say, twenty because of the difficulty of Sisinguiel ge 
among the bulk of middling people. But it has the advan 
of avoiding the vagaries of absolute standards (cf. below). ae 
is best to put each name on a separate eard and let 
rater sort them out. If a printed list is given, judges ar 
apt to rank people at the top too high, those at the bottom 
too low. Seii ` 

In the paired comparison method the rater is given BEN 
possible pair of names and asked to say which of the two 1 
higher. The results can be expressed finally as a ranking or & 
scaled score! For most purposes this is an annee ss 
refinement., It is usual to convert rankings into normally 
distributed scores, for example by means of Hull’s or Symond’s 
tables. If this is done it is quite simple to combine or average 
rankings provided by different raters, each of whom may have 
judged a somewhat different list.® (For example, the Maths 
matics, French, and English teachers may each have ranke 
some, but not all, of a group of children.) a 

2. Numerical Ratings. It is an old parlour’ game to give 
people marks for traits. Dr. Johnson is said to have been 
annoyed at getting 0 out of 20 for Good Temper and Manners 
from Mrs. Thrale, though he also received 20/20 for Morality. 
Obviously it is impossible to distinguish as many as twenty 
grades, or to give any consistent meaning to percentage mork 
and the like. Experiments by Symonds and others show h 
five, or at most seven, grades is the largest number that t T 
ith. These may be denoted 5, 4, 3, 2, 3 
—1, —2, or turned into letters, A to E, © 


Strongly present, Present, Average, Lacking, Strongly 
Lacking, and so on. A smaller number of grades than five (ee 
Yes, Doubtful, No, or +, 0, —) is rather wasteful of the rater 2 
powers of diserimination, but is nevertheless often used whe? 
yo ct Guilford, J. P., Psychometric Methods, New York : McGraw-Hill, 

* Hull, C. L., Aptitude Testing. Yonkers, N.Y.: World Book Co., 1928 
Symonds, see Bibliography. 


* Note that it is essential that each person should be rated or ranked py 
the same number of judges. If 4 judges rate some, and only 2 judg 
others, the spread or scatter o 


il 
f ratings among the former will necessarl Z 
be smaller than among the latter. 
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there are man i 
y questions to be answered or traits 
le D l aits to be rated, 
ER hr rater has little detailed knowledge of the subjects 
T . . 
a Is er defect of this type of scale is the variations 
a e s and distributions adopted by different raters. 
See von same difficulty arises in the marking of English 
Ee of essay-type examinations, which is, of course, a 
Be rating.) The psychologist would naturally prefer 
ar gs of any large group of subjects to conform fairly 
y to a normal distribution, say : 


A. Bo CRRIDMIRE 
7 24 88 4 7% 


indeed they seem to 
and most avoid 
ften reduces to 


Mo: 

en tend to be too generous ; 

using th or Average as a term of abuse ; 
e extremes, Thus the distribution o 


Something like this : 


A BF CADERE 
= 60 so 7 50% 


ine 
d. If one rater scarcely ever awards an A 
he trait than 


ven on rec 


sc 
no ya teachers from numerous different schools 
SETAN the secondary school teacher of the j 
Taters ad e latter cannot know what distributions the different 
indust opted. The same is tru t supervisors in 
gain ry, or different officers co s of soldiers. 
Tating, it is impossible to combin ts of several sets of 
vary (j of overlapping groups of people W. 
Dan DiS as different sets of examinatio 
ometin e combined unless scaled to & com 
Ic es, therefore, raters Or markers are 
f. Vernon, P. E., The Measurement of Abilities. London: University 


of 
London Press, 1940. 
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. to. 
an ideal distribution which they are advised to adhere to, 
has: 
suc’ wits 5 D E 
10 20 40 20 10 


F : sel 
This is a very simple pattern, which Spprozimetes CI 
enough to normality. But they seldom conform i d. Eaa 
they are trained to do so, and are frequently checke ther than 
be preferable, then, to force them to use relative dee 10% 
absolute ratings, that is, to get them to pick out the f B's and 
and worst 10% of their group of ratees, then 20% a r forms 
20% of D’s. (The same method can be applied to 3% ne: none 
of distribution.) They will naturally object that t ae g This 
are almost all higher, or lower, than the general gene fom 
may well be true when, for example, ratings are collec eal tee 
different primary schools that feed one grammar sc roups 
from different streams within any one school. If the A ia 
are quite small they are especially apt to vary in merit, AR 
it is unfair to reduce them all to the same mean and sta = ty 
deviation. It is sometimes possible to adjust the u on 
allow for such group differences, as when essay ami t the 
marks are scaled against objective attainment tests. pu that 
difficulty is such a serious one in the field of personality; only 
we must conclude that ratings should generally be use more 
within groups all of whose members are known to two or aring 
raters. There is no really satisfactory way of ee 
ratings by different judges of different groups, unless the g P 
are large enough and similar enough for relative rating see 
fair, Obviously this greatly restricts the practical use 

of ratings, 7 ed 
a Various devices have been intro s. 
s down to more consistent standard 


lowest, another half-way 
names were retained as a 
any new officer, X, the ra: 
most closely resembled. 


1Cf. McClelland, W., Selection 


London ! 
University of London Press, 1942, 


for Secondary Education. 


Ratings and Judgments of Personality 107 
a other traits. Since different raters still have different scales, 
t is doubtful whether this is of much value. Its application 
© the marking of handwriting specimens or children’s com- 
Positions or drawings by means of quality scales, where all 
markers use the same standard set of specimens, is a different 
Matter, 
ee Verbal and Graphic Scales. The substitution of such terms 
ne Excellent, Good, Average, Poor, Bad, for letters or numbers 
5 ca little help ; though a clever choice of terms will sometimes 
En to counteract the tendency to undue generosity OT the 
G ency to avoid extremes. For example : Excellent, Very 
ood, Good, Fair, Weak, may produce a better distribution. 
= an extension of this idea, proposed by Freyd,! has De 
as y widely adopted. In the graphic scale, each step is define 
„concretely as possible, so that the rater no longer has to 
uk Quantitatively or bother about standards. In addition 
en ambiguity of vague trait-names like leadership, industrious- 
IL ete., is avoided to a considerable extent ; verbs describing 
an Aviour are substituted for nouns and adjectives. Bem 
muse from the American Council on Education scale tor 
ing college students ?: 
eae X need constant prodding, or does he g 
without being told ? 


o ahead with his 


Needs — i i — 
Prodai much Needs occas- Does ordin- Completes Seeks eae 
oin, 8 in ional prod- ary assign- suggested for N r 
ary srdin- ding P ments of his supplement- additiona 
Ments pen own accord ary work 

T . 

line ed merely puts a tick or cross at whatever pomy on mi 
Mmea that he thinks appropriate, but the Gian Bie 
Sca) Sure off this position as accurately as he wis nr = 
a Ra Were used during the war for assessing the g P 
and An candidates, or the efficiency of soine Hee a 

el i tisfactor, : 
they l considered the most sa ita eg eps oases 


embody several questions with % 


l., 1923. 
82-108 yd, M., * The Graphic Rating Scale’. J. Educ- Psychol., ) 
2 C i 
Scale‘, Bradshaw, F. F., ‘The American Counci 


Arch. Psychol., 1930, 18, No. 119- 


il on Education Rating 
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(ocasionally two or four), these being couched as far as possible 
in Service language. For example : 


A.1. Hard conditions tended to get him down. 
2. He accepted bad conditions cheerfully enough. 


8. He helped to keep up the men’s spirits when conditions 
were bad. 


B.1. He has a flair for improvising (tools, materials, etc.) 
in an unexpected difficulty. t 
2. He is reasonably good at making the best use of wha 
is to hand when things go wrong. 
8. He is lost without the usual tools, materials, etc- 


The grading on any one question would naturally be too 
coarse, but with half a dozen to two dozen or so questioni 
each covering some different aspect of efficiency, a total scone 
can be derived, and these scores tend to show good distribu 
tions. Another advantage of such scales is that inexperience 
raters find them easy to understand and apply. Even thoug 


the number of questions looks formidable they can actually 


be answered quite quickly. A minor objection is that they 
use up a lot of paper. 


A Major ones are that some TEE, 
continue to be much more generous than others, and tha 
correlations between different raters of the same subjects are 
still quite low. 


Our last example illustrates the device z 
al trait into a number of more re. 
are separately r: e scores ¢ 
bined. Such component. shoud, sh eel a be relatively 
another, in order to cover the whole scop? 
ently as possible. However, halo is Bee 
t of the items within such a scale tend to 5 
e direction. Components regarded as mea 
f course, be given higher weight in the to" 


igh actually thi : ; that it 18 
seldom worth Whe y this makes so little difference 


aus One would ex: cales to be mer 
objective, and to be more he | by aiffer 
Taters, because they avoid indefinite and equivocal tr aitaan es, 
is not very favourable. In several A 
en total scores given by two raten for 
me :5 level which is normally foun 


so strong that mos 
rated in the sam 
important can, o 
score, thou, 


Ratings and Judgments of Personality 109 


eneral-trai ; ee 
ya ratings. (Similarly in the marking of essays 
region: emes. are often found to be no more reliable than 
Ann nee { 
(ie. panna detailed scales, or third-person questionnaires 
Bex published questionnaires to be filled in by a rater) have 
Sti ann . Marston’s scale of twenty items indicative of 
the soci] a, in children, and Bridges’s scales for 
nae i emotional maturity of pre-school children, are 
Emotional ute Laird’s C-3 test of introversion, Willoughby’s 
On initrover Maturity scale, and Heidbreder’s questionnaires 
DOG result sion and inferiority attitudes,? have given rather 
whose En peruap. because they are designed for adults, 
of children onal traits are less overtly expressed than those 
h it 
into u would suggest, then, 
all number of 3-point sub-scales may be worth while, 


ecause 
will Measles raters find this easy to answer, and because it 
y yield a good distribution of total scores ; but that 
ozen items or aspects of a 


€ inclie: 

Tait is an of more than half a d 
© be dis En y to be a waste of time. If several traits, presumed 
nct, are to be rated at the same time, each should be 


Cover © 
on a ea by three or four items, and the ¢ 


that breaking down a trait 


tems presume 
sely with Trait B items, an 
orial analysis, OT by simply 
h improved com- 


3 ane can thus be constructed. 
ndardized and ‘ Derived’ Scales. We generally assume, 
graphic, verbal, 


Witho 
1 M 
Stua Arston, L } i ante 
a ti f£ Young Children - Univ. I 
ae men Welfare, 1025, 3, "Bridges K. M. Be The Social and Emotional 
s Laird ùl of the Pre-School Child. London : Kegan Paul, 1981. 
publican ı A., Personal Inventory, ©? Hamilton, N.Y. : Hamilton 
Pi ze Psy <A Scale of Emotional Maturity `; 
: Ychol., 1932, 3, 3-36. Scale published by Stanford University 
! N Introversion and Extroversion + 
E., ‘ The Normal 


ess, 
J > 19; 
Inte St 
Nferiori oo Psychol., 1926, 21, 120-134. Heidbred aag aa 
1., 1927, 22, 299° 
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3 jor 
or numerical scale are equidistant, i.e. that A is as much SPERRE 
to B as B is to C, ete. One technique of achieving ai i 
units is to apply Thurstone’s method of attitude ae Rs ables 
long series of statements. As shown in Chap. IX, t ORE 
us to assign a rational numerical value to each of the sta Bi 
we select, and at the same time to eliminate ambiguo ae 
unsatisfactory statements. The following are three statem 


; ; o 
standardized on a 0 to 8 scale, for assessing the efficiency 
travelling salesmen 1: 


6:9) He is making exceptional progress. lks. 
a He is somewhat in a rut on some of his brand tal 


“re 
(5-6) He tends to keep comfortably ahead of his wo 
schedule. 


tee 
Any of the statements that are thought to apply to the el 
are checked, and their average scale value gives his 
Willoughby’s Emotional Maturity scale 2 is similar. Vi eland 
A different form of scaling is represented by the a 
Social Maturity scale.» This contains 117 items such as : 


Reaches for familiar persons (4 months.) 
Dries own hands, 


(23 years.) 
Is trusted with money. (53 years.) 
Makes telephone calls. (10% years.) 
Provides for the future, (25 years.) 


Q. The Subjective element is greatly reduced since ae 
1 Richardson, M, W., and Kuder, G., F, « Making a Rating Scale t 
Measures ’, Personnel J., 1933, iby, 36-40. 
2 Op cit. Training 
® Doll, E. A., Vineland Social Maturity Scale. Vineland, N.J. : liminasy 
School, Educational Test Bureau, 1936, Cf. Doll, E. A., ‘ Pre mere J. 
Standardization of the Vineland Social Maturity Scale’. 4 jicatio® 
Orthopsychiat., 1986, 6, 283-293. It appears to be suitable for app 
in Britain; ef. Kel 


jal 
; Soci® 
‘mer Pringle, M. L., ‘Social Maturity and 

Competence ’, Educ. Rev., 1951, 8, 113-128, 183-195. 


a Tp EEE 


Ratings and Judgments of Personality 111 


examiner, by skilful questioning, can discriminate fact from 
interpretation in the informant’s statements. It is claimed that 
different examiners, even interviewing different informants, 
arrive at Social Ages with as good a reliability as 90. The 
resulting S.Q.s tend to correlate rather highly with I.Q.s, but 
there are good grounds for thinking that they also represent 
Some aspect of social competence and personality maturity 
which is particularly relevant in certifying mentally defective 
subjects. A similar scale for assessing altruism among children 
is described by Turner. 

The Haggerty-Olson-Wickman Behavior Rating Schedules * 
represent another interesting development. A series of graphic 
scales for rating thirty-five common traits was applied by 
teachers to a group of children who had previously been very 
carefully assessed by psychologists for personality maladjust- 
ment. By analysing the numbers of well and poorly adjusted 


children who received each particular rating, a maladjustment 
index was derived for that rating. The Schedule could now be 
l indices calculated for 


applied to any fresh child, and the tota e 
all his ratings in order to show his maladjustment. This has 
the advantage that the teachers are not asked to assess person- 
ality disorders as such, but only some of the more acceptable 
social, emotional, and other traits. Thus biases and misinter- 
Pretations of traits should be greatly reduced. Nevertheless 
there is still considerable subjectivity, since the maladjustment 
scores obtained from ratings by two teachers usually correlate 
only to about -60. As Olson points out, the same set of ratings 
could be standardized against other criteria ; a series of scoring 
keys could be built up in the manner of the Strong Interest 
Blank (cf. p. 164). It would be of interest to try out this 
technique in grammar school selection, where it is known that 


ordinar ity ratings by primary school teachers are of 
ee E ee ked to check a series 


little value (ef. p. 25). If teachers were as: : 
of concrete statements (similar in form to the Army rating 
Seales, p. 108) about the work and behaviour of their pupils, 
also about health and home environment, it might well be 
Si 1 Turner, W. D., ‘ Altruism and its Measurement in Children’. J. Abn. 
oc. Psychol., 1948, 43, 502-516. 

2 Published Br World Book Co., Yonkers, N.Y., ‚1930. CE. Olson, 
W. C., Problem Tendencies in Children. Minneapolis : University of 
Minnesota Press, 1930. 


iu 
u a u ; 
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i iated 
found empirically that some of the statements a En: 
pupils who later had successful and unsuccessfu ae hE be 
school careers, Only a proportion of the statements raat A 
diagnostic, but a really valuable measure might be Sees nee 
from these. Such a research would be lengthy and a in 
to be done on very large ee, rated by many teachers, 

T to yield a reliable scorin ey. ee 
Econ similar is A. H. 5 . Baines’s 1 method of 
gradings of the efficiency of civil servants. A number = dozen 
visors of, say, clerical officers, rate these officers on gli 
aspects of their work, on 3-step scales, and also EE for the 
general (5-step) grading. Correlations are celeuatic Br 
sub-scales with one another and with the final gra 
these show which sub-scales most closely predich om ding 
efficiency. A total score is then based on the genera eat the 
plus the best sub-scales, suitably weighted. pore think 
supervisors are not asked which qualities or aspects > pie to 
relevant ; by their own use of the scale they show the 
which they attach most importance, y Jes 

The mages sophisticated development of derived rating aa in 
is the Forced Choice type, which is employed, for oa PE A 

‘S. Army officer report forms. Such scales contain S 
blocks of items, like the following : 


A go-getter who always does a good job. 
Cool under all circumstances, 

Doesn’t listen to Suggestions. 

Drives instead of leads. 


to be equally popular = 
n their diseriminatory power ; one cor a 
well, the other badly, with some criterion—say "eo two 
y ratings. Similarly t which; 
Thus a scoring key is available s tl 

Y contrasting the valid and non-valid items, reduce 


: : 4 0 88. 

ratings to bias and halo. It js too early Bvious 
whether the advantages of this technique outweigh the © 
1Cf. Anstey, E., ‘Staff 


, 
ment * 
Reporting in a Government Depart 
Occup. Psychol., 1950, 24, 200-229. 


Jates 
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disadvantage that most raters find it highly irritating. 
Probably it has greater promise in the field of self-rating 
questionnaires. 

7. Voting and Guess-Who Techniques. Ratings of pupils by 
one another on ordinary rating scales are of little or no value. 
Children are even less familiar than educated adults with the 
meaning of traits, less able to think of traits quantitatively, less 
able to observe objectively ; and their suggestibility or contra- 
Suggestibility to the teacher or psychologist who asks for the 
ratings may entirely distort the results. The idea of voting, 
however, goes down more readily, and by the age of 9 years or 
So they can pick out the two or three in their class who they 
think are highest or lowest in various respects. Hartshorne 
and May drew up a series of short character sketches, e.g. of 
a very selfish, moderately selfish, an average, and an unselfish 
child. Each member of the class was asked to guess whom 
these represented. With so large a number of raters, a pupil’s 
Score for selfishness is readily obtained from the number of 
times each sketch is assigned to him, and the scores show 
Satisfactory reliability. Here is an example from a scale used 
a a British research on emotional stability with 10 to 12 year 
olds 2: 


lways happy and enjoying them- 


These three people are a 
we ble to annoy them. They never 


selves. It is impossi 
change. 


Men a (2) I Be aa 


o are very changeable. You can 


H 
peda se ne: They are offended and 


never depend on them. 
annoyed very easily. 
(a) 


s le 
of tor Travers, R. M. W., ‘ A Critical Revi ity and Ran 
the Forced-Choice Technique ’. Psycho 'dity and Rationale 
; E., “Reply to Travers’ “ A Critical Review of the Validity a nr 194 
Of the Forced-Choice Technique ” ’- Psychol. Bull., 1951, reed ER 
EON work indicates that the most re ults are obt 
Pe aama eE Effect of Temperamental Traits upon HR 
Test Performance. Ph.D. Thesis, University of London, 1952. 


114 Personality Tests and Assessments 


6 : iques of 
Very similar are the well-known sociometric co pul He 
Moreno,! where each child writes the names of o ns 
would like to sit next to, or to play with, and so a struci 
are generally used to present a picture of the socia N a 
the group rather than to assess the popularity or EA BE 
individuals. During the war the anne aging the 
was often applied, particularly in en | ee Bid i: 
suitability of a recruit, in the eyes of his fe neophistieate 
mission. Among relatively uneducated or u Be 
adults this very simple form of rating is as abpropr ee 
among school children. Obviously such el opin the 
extremely biased judgments; the reputation i PTa by bis 
eyes of his class-mates may seldom accord with Taie Thus 
teachers. Nevertheless it may possess useful va IS ratings 
in MacArthur’s study of persistence (cf. p. 14), pap composite 
agreed rather more highly than teachers’ with SA nt 
results of persistence tests, And follow-up = a recruit’s 
Services indicate that the summed opinions o the recor 
fellows tend to have better predictive value than 
i nding officer. 2 Pay 
mendation of a ne poe Mee tise to the ratok ia 
of traits according to their Reo a dure 
s contrasted with the ordinary Pr way it 
individuals on a trait. In some for, say» 
her a person is most outstanding umerous 
Sociability, Instability, etc., than to arrange in the 
ing to degrees of Sociability. 


they can be sorted on 
10% most outstanding traits in X, whi 


or when none of the avai 
than a few individu 


how! 
als. Burt and Stephenson 3 have SM, 
how the results can 


Tes 

+ Monog 

“Moreno, J. L., ‘ Who Shall Survive?’ Nerv. Ment. Dis. 

1934, No. 58. 198%, 
2 Cf. Jenkins Bibliography. B hol., 37° 5, 
3 Burt, C. ne Correlations Between Persons’. Brit. J. Psychol a J: 

28, 59-96. Stephenson, Was 
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correlati . 
en between different persons (Stephenson’s ‘ Q- 
), as distinct from correlations between different 


t 2 
ests or trait-ratings (‘ R-technique ’). 


DEVICES FOR IMPROVING RATINGS 
of rati 7 en shown to increase the reliabilit, 
litte ise i.e. the agreement between different raters, neue 
eonsidered Ei regarding their effects on validity. We have 
ratings; (2) x ee : (1) the superiority of relative to absolute 
Possible ady St graphic to numerical or letter scales; (3) the 
(4). Choo antages of breaking down traits into components. 
define th se only straightforward, unambiguous traits and 
em concretely, avoiding as far 
Hollingworth and 


ivocal and un- 


equivo ; 
cal traits, though obviously much depends on how they 
n1 showed 


are d 
lefined. An interesting study by Stephenso: 

He 
f 100 children, and 


A number of steps have be 


factor. 

ments, ansed the correlations between the 1 

another out half the teachers agreed quite cl 

Closely > While the other half agreed with each other but less 
with the first group. Clearly there were two different 

ili eared to base 


aoe Clearly, better definition was called for. .. 
On one tr. en several traits are to be rated, i ndividuals 
pele swith: at a time, not each individual on all traits (unless 
ater a. n-persons technique). ‚This is sup 

(6). ¥ ependence between traits, ie. less halo. |. . 
Contain; Owever, if a rating form is being used for each individual, 
ng numerous traits or items, then the items should be 


on the fi relatively. desirable 2 
orm so that the dom fashion. Otherwise 


le ext ? 
© extremes alternate 1n ran } > 
Er liable to go down the page checking the desirable 
* Steph rable) answers or grades throughout. 
an Apple Analysis; with 
SE Ppli RA i Jnverted Factor AD: ysis, 
a to ae tion ataa ` J. Educ. Psychol., 1936, 27, 


e Tater 
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(7). Warn the raters of the nature of halo and encourage Be 
to avoid it. One way of eliminating it is to sum the ratings © 
each individual on all traits and to regard this as a measure 0 
halo or general popularity, which can then be subtracted from 
the separate trait-ratings, or held constant by partial correlation 
or factor analysis. Since, however, different traits of whic 
society approves certainly do overlap to some extent, this W! 
tend to over-correct. Nevertheless, in a research by the writer, 
this device did improve the validity of sets of ratings wa 
compared with trait-composites. An alternative would be 
get raters to estimate their personal liking for or dislike of eat 
ratee, in addition to assessing his traits, and to remove i 
influence of this from the total ratings. Though this greatly 
oversimplifies the nature of halo, it would be of some help 
Hartshorne and May, and Chi,1 suggest that halo differs among 
different judges, hence we could estimate the true overlappin& 
between traits by correlating A’s rating of Trait 1 with 
rating of Trait 2, and so on. The differences between suc 
correlations and the inter-correlations of A’s (or B’s) ratings 
all traits, would provide a measure of halo. The flaw in the 
argument is that A’s and B’s biases are only too likely to overlap» 
especially if they are both people with a similar relation to t t 
Tatees, say teachers. And even raters with such differen 
outlooks as teachers, parents, and pupils are liable to he 
similarly influenced by liking, or by common confusions abou 
the meanings of traits. In other words, the correlation betw a 
Traits 1 and 2, when rated by several judges, is made up °° 


genuine overlapping between the behaviour included in ws 
traits, plus confusions and biases common to ae 
any pair of, raters, 


It is reduced by confusio 


een 


«dual 
J ns and biases which affect individu® 
taters only. This is why, in Chap. I, little credence was attach? 
to factorial studies of ratings. They throw more light oD Jo 
linguistic problem of how people interpret traits than they 
on the structure or organization of personality traits as sue 
(8) Nevertheless it is certainly an advantage in any eXP a 
mental or practical application of ratings, to obtain independen 
judgments from 2 or more judges, since this helps to cancel © 


1 Chi, P. L., ‘ Statisti N ; : x Ë ; Expt" 
Educ., 1937, 5, a Analysis of Personality Ratings’. J 
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vida prejudices. There is little point in having more 
an 4 or 5, unless the rating scale is very coarse (as in voting 
among pupils). By inter-correlating the sets of judgments and 
applying the Spearman-Brown prophecy formula, it is easy to 

ded in order that their 


determine how many judges are nee 
ble level of reliability, 


pobined rating may reach an accepta 
ay -9. The writer would suggest, however, that diversity of 


mages is more important than number. The ideal would be to 
pave 2 or more of each type of judge, for example, 2 teachers, 
Se Bia Oe ee 2 relatives, and a group of pupils 
t o aim for high agreement between the judges within - 
ype, but lower correlations (representing different viewpoints, 


between types. 

(9). Raters should be trained in the use of the scale, and if 
they are required to use it frequently—for industrial, educa- 
tional or other purposes—their distributions, reliability, etc. 
should be checked periodically. Normally they can be assured 
that the ratees will not know what they say about them, so 
that they can be completely candid. But their attitudes to the 
person requiring the ratings should also be considered, and their 
full co-operation sought. In fact, much the same difficulties 
occur here as with self-ratings and personality questionnaires 
(cf. Chap. VIII). For example, raters are likely to be defensive 
about attributing undesirable traits to their friends. During 
the war it was noticed that instructor officers at a training 
school would willingly agree that some of their cadets were weak 
in efficiency, social adjustment, intelligence, etc. But similar 
officers in command of units to which these cadets were posted 
would give much higher ratings because, consciously Or 
unconsciously, they resented the suggestion that their own unit 


could contain any inefficient people. 

(10). It is usually stated that raters should have had plenty 
of opportunity to observe the kind of behaviour they are rating, 
and should be well acquainted with the ratees. Obviously there 
is some truth in this: an interviewer who merely talks to a 
prospective employee for half an hour can hardly assess his 
practical skills or the dependability of his character. But the 
tendency for an individual to express his personality in every- 
thing he does, also the tendency, for closer acquaintanceship to 
lead to more rigid and biased opinions and greater halo, should 
not be forgotten. Thus the evidence, SO far as it goes, actually 


9 
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suggests that an impartial observer and interviewer (particu a 
if he has applied performance tests or analogous E RA 
provoke significant behaviour) can give at least as re Be. of 
valid ratings as a close friend. Ferguson 1 compared ra} AER 
travelling salesmen by managers who were en. the 
them to varying degrees, and did obtain better ratings id be 
better acquainted; but all of them, presumably, pic fou 
fairly distant in their relations to the ratees. Se ia 
that a period of observation before rating auprored sd that 
bility of judgments to some extent. But Knight ° showed 
intimacy or length of acquaintanceship led to more BE ig 
and greater halo. Presumably a more superficial know boys’ 
also more detached. In Newcomb’s 4 research at Beri, 
summer camp, careful records were kept of actual be = STE 
and these were used for checking the accuracy of rater 

had obseryed some, but not other, kinds of behavionn im by 
validity of the ratings on observed traits was nepresent vn 
correlations of -54 and -45, while for traits which had pee un) 
observed but were inferred the correlations were -39 an 


istically 
The latter figures are somewhat lower, but to a statistic 
insignificant extent. 


JUDGING ABILITY 5 


ifferent 
It is dangerous to generalize about the goodness of differ 
types of raters, since th 


ings in Criterion 
* Ferguson, L. W., ¢ The Value of Acquaintance Ratings i 
Research’. Amer, Psychologi. 


290. aN. pl. 
? Slawson, J., ‘The Reliability of Judgment of Personal Traits’. J+ AP 
Psychol., 1922, 6, 161-171. 

3? Knight, F. B., ‘The Effect of 


» upon 
Personal Judgments’. J, Educ. P. 


the “ Acquaintance Factor 


sychol., 1928, 14, 129-142. 
“Op cit., p. 95. À yet un“ 
* An excellent review of this field is given by R, Taft in an as 
published thesi; iversi i 


ations 
is (University of California, 1950). The een, 
listed here are partly based on his intensive experimental investiga 


= Rati 
atings and Judgments of Personality KO 


are new acı i 

fairl quaintances or old friends? N 

ae ing, sociolly Ae ann en tal m 

En: ed. ams ? found th tth al 

ee to be extraverted; for the EL 

an be his own personality, but he may be i ate 
rs to judge them impartially.) The good ae 


tends al 
so t ini 
maturity, = be aber average in intelligence and in personalit 
egration. Though in some situations artiste 
stronger evidence that. 


inclinati 
aR seem helpful, there is 
d hey are in are superior to social science or arts students. 
judgments Be also to psychologists, except in so far as the 
Is no om ve knowledge of technical terminology. There 
generally m ‘or the belief in feminine superiority. Raters are 
and cultur er successful in judging people of similar age, Sex 
others ae wid background to themselves. Hollingworth and 
desirable hen some evidence that people who are high in a 
Or undesirabl tend to rate it better, and that the reverse holds 
at rating “s e traits. Thus the most ‘ snobbish ” are not good 
that a rater hobbies te Finally, the degree of confidence 
of their ee in his judgments is a very poor criterion 
N: . 
Several EA also in their judg-ability, i.e. the extent to which 
ratings of thet agree about them, and in the extent to which 
alo. The ome different traits are influenced by, oF free from, 
Ness ys, en is sometimes considered to show their open- 
igmatieness, the latter rity VS: indivi- 


duali . F 
ity. B nes their medioc! 
ut there is little evidence to confirm such suppositions. 


OTH 

ER METHODS OF EXPRESSING JUDGMENTS 
OF PERSONALITY 
ng useful ratings in education 
kely to know large numbers, 
eldom possible—we shoul 
thods, such as the 
h. The ordinary 


Tn vi 
or in, RT of the difficulties of getti 
and ee ee few raters are li 
enquire a thorough training is S 
estimonial the value of less formal me 
and the free personality skete 


racteristics of the Good Judge of 


Cf y 
an onality 7 P. E., ‘Some Cha 
Peyadams, H a Soc. Paychits 1938/4; a = 
ol., 1927, 22, Eme Goo Judge of Personality '- 


J. Abn, Soc. 
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structure that it reveals, 
it is not distorted by the 
the complex mass of evid 


candidates independently Prac 
exercises, finally assessing f -65 
heir mean inter-correlation 5 ot 
‘86 obtained by military © lec- 
sises. In the Civil Service ae 
the Psychologist’s final judgment © his 
lated as highly as -87 with the Final Board decision, an 

1Cf. Vernon, P. E., ‘ The 
Procedures ’, Occup. Psychol., 1950, 24, 75-95. 


ih 
a 
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validity coefficient with efficiency gradings after 2 years was “49. 
But these figures were no higher than those of the non- 
Psychological staff members. On the other hand, the studies of 
Vocational guidance conducted by the National Institute of 
Industrial Psychology (cf. p. 29) show that psychologists’ 
judgments based on a free synthesis of interview and other 
material possess good, though certainly not perfect, validity. 
The following conclusions seem to emerge regarding the best 
Way of obtaining useful information from associates about a 
candidate for employment, or from teachers about a pupil 
emng considered for promotion to secondary schooling, etc. 
A set of ratings should be asked for, probably in the graphic or 
questionnaire form described on pp. 107-9. The number of 
Questions, say ten to twenty with three answers each, should be 
ept as few as possible, consistent with covering most of the 
relevant aspects of personality. If the various questions can 
© validated and keyed against an external criterion of later 
auccess, in the manner of the Haggert -Olson-Wickman i 
or tet choice scales, so much the better. But the nn ee 
s bi ratings will usually be to force the judge to consider p 
op lect from as many angles, and as systematically an 
Woe as possible. Thereafter he should be aske ; 
e ite a free personality description, testimonial, or case-study, 
*plaining and commenting on his ratings, mentioning y 
tig evidence, and filling in what seem to him the ma 
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(d) Several persons besides the author judge the N 
of each item. An extension of this is the application o 
Thurstone attitude-scaling technique (cf. Chap. IX). ‚hen 

Most of the hundred or more tests that have been publis a 
are modifications or extensions of three prototypes : Woo 
worth’s Personal Data Sheet, Freyd-Heidbreder’s Inira 
Extraversion test, and Allport’s A-S (Ascendance Sultan 
Reaction Study. Instead of trying to give a comprener an 
list, we shall outline these, and mention briefly others, w. s 
have been widely used, or which embody special points 
technique. 


TESTS OF PSYCHONEUROTIC TENDENCY AND 
EMOTIONAL INSTABILITY 


Ae rived 
The 116 items in Woodworth’s 1 test were originally dee 
from medical psychologists’ descriptions of the symptoms 


3 à ; ive 
neurotic patients. The following are some representativ 
examples : 


Do you usually feel well and strong ? 

Do you ever walk in your sleep ? 

Have you ever had fits of dizziness ? 

Did you have a happy childhood ? 

Do you know of anybody who is trying to do you harm ? 
Does it make you uneasy to cross a bridge over a river ? 
Have you ever been afraid of going insane ? 

Have any of your family had a drug habit ? 


Each question is followed by ‘ Yes, No’, one of which e be 
be checked, Mathews, Cady,? and others have adapted the ‘oD 
for use with children, and Burt 3 publishes a British vera 
which, however, he recommends as an interview aid rather # B- 
as a quantitative test. Laird’s 4 Personal Inventory 


1 Woodworth, R. S., Personal Data Sheet, Chicago : Stoelting, a 
2 Mathews, E., ‘ A Study of Emotional Stability in Children ’. J. . Piy if 
1928, 8, 1-40. Cady, V. M., ‘ The Estimation of Juvenile Incorrigi) 
J. Deling. Monogr., 1923, No. 2. 1935- 
? Burt, C. L., The Subnormal Mind. Oxford University er chols 
“Laird, D. A., © Detecting Abnormal Behavior’, J. Abn. Soc. PSYC 


Š lican» 
1925, 20, 128-141. Inventories published by The Hamilton Repub 
Hamilton, N.J., 1925. 
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contains similar i 2 ae 3 
responses, ae: = items, but with multiple-choice (graphic) 


Have you (during 
ne Dat ger in. — 
peeks) beenafraid avoided accepted did not 
‘esponsibility ? it when forced mind it 
upon me 


liked welcomed 
it it 


The most widely used pre-war test, Thurstone’s ? Personality 
Woodworth, Laird, 


en contains 228 items collected from Wo 
stude: er sources. Percentile norms are available for college 
ita Many other shorter and simpler tests were devised 
who g the war, and used with some success IN screening recruits 
é et be liable to neurotic breakdown. These included 
hori ie Defence Research Couneil’s inventory (NDRC 
and th ormat), the Neuropsychiatric Screening Adjunct (NSA), 
Questi © Cornell Indew,* in America ; also the Maudsley Medical 
in Brit, nnaire,? and the Sutton Booklet or Bennett-Slater * test, 
items ain. The latter is a composite test, in ten sections, whose 
of Fe eleverly disguised. Three sections deal with symptoms 
questi, lety, hysteria, and depression ; but in about half the 
mea a negative, instead of & positive, response indicates 
good tie tendencies, so that the testee who wants to create a 
impression cannot merely check ‘ No ? throughout. Four 


Secti s, . + . 
ions contain lists of various types of annoyIng situations : 
‘t Somebody tells you 


(1) Frustration of self-assertion, ¢-8- 
(2) p how to do your job °. ; 
ersonal inadequacy, €g- © You forget what youre 
(8) p akng for’. ! 
(4 irt or untidiness, e.g. * An unmade bed 2 
Noise, e.g. ‘ The sound of hammering ’. 


Psychol one, L. L., and Thurstone, T- G» “A Neurotic Inven 
aM cee ty pane Test published by Univers 

we of Scienti 

‘lita cientific Research and *" National Defence 


TY Effici i 
‘ficiency. Washington, D.C tions paychology in Wo orld 
J.: Prince- 


1, Vol. 1 A f i 
iven IV. Measu t and Prediction. Princeton, N- ri 
Cornel gorsity Press surement der, A-, Mittelmann, Go et. al, * The 
Published lectee Index’. J. Amer. Med. Assoc. 44, 124, 224-228. 

N by Psychological Corporation, New York, 1948. 


tory’. J. Soc. 
Press, 1929. 


Discrimination of, 


4 Bena, Ysenck, Bibliograph: 
ett, y- 
Naoto fom Komma AEn? Brit J P BL 1945, 20, 271-282. 


om Normal Subjects’. Brit. J. 
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Evidence is given to show that neurotics check items of types @) 
and (4) as annoying much more often than normals do, wee 
(1) and (8) affect normals and neurotics alike. Thus scores Sai 
based, unknown to the testee, on the differences between the = 
sections. Finally, three sections are adapted from el: 
Cross-Out test (cf. P. 175). They contain lists of words w. 
the testee crosses out anything : 


(1) For which people should be blamed, e.g. ‘ Flirting, 

Speeding ’. s preo 
(2) Which he has worried about, e.g. ‘ Loneliness, F aling ; 
(8) In which he is interested, e.g. ‘ Football, Comedians - 


t 
Neuroties are likely to give many answers to (1) and (2), bu 
relatively few to (8). 


INTROVERSION AND ASCENDANCE TESTS 


Freyd-Heidbreder Test! Freyd collected fifty-four items 


descriptive of the introvert type from Jung’s writings, of which 
the following are samples : 


Blushes frequently ; is self-conscious. 
Day-dreams, 

Prefers to read a thing rather t 
Shrinks when facing a crisis, 


Is reticent and retiring; does not talk spontaneously. 
Is slow in movement, 


eeps in the background on social occasions. 


Heidbreder turned these into a self-rating test, where the ee 
checks each item +, ? or —. Laird’s 2 Personal Inventory ts 
and various other adaptations are available. Other tes 
such as Neymann-Kohlstedt’s and Root’s (described and bes 
in England by Wyatt and Langdon), were constructed a 
*Freyd, M., ‘ Introverts a 3 . Rev, 1924, 3y 
74-87. " Heidbreden Rie Test, RE 


+» * Measuring Introversion and Extroversio? ° 
J. Abn. Soc. Psychol., 1926, 21, 120— 84, 
2 Op cit. 


r 
? Neymann, C. A., and Kohlstedt, K. D., <A New Diagnostic Test fo 
Introversion-Extroversion m 


7. 
J. Abn. Soc. Psychol., 1929, 23, 2, 
Root, A. R., “A Short Test of Introversion-Extroversion °. Personne dom 
1931, 10, 250-253. Wyatt, S., and Langdon, J. N., ‘ Fatigue and Bone 
in Repetitive Work’, Indusir. Hlth, Res. Board Rep., No. 77. Lon 
H.M. Stat. Office, 1937. 


han experience it. 
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that items discriminated between schizophrenic and manic- 
depressive patients. The assumption that these psychotic 
groups represent the extremes of normal introversion and 
extraversion is very dubious; in fact, Eysenck offers an 
experimental disproof (cf. p. 36). Hence this type and the 
Freyd-Heidbreder type of test give very poor correlations with 


one another. 

A similar scale for schizothymia-cyclothymia, based on 
Kretschmer’s work, is published by Scholl.* 

Allport’s A-S Reaction Study. Here the items were made 
up to represent concrete manifestations of dominatingness 
(ascendance) or submissiveness, and were standardized by 
comparing the answers of students who had been rated by 
associations as highly ascendant or submissive. The following 
are examples; the numbers show the weighted scores for 


ascendance : 


A salesman takes manifest trouble to show Yes, asarule . .—lı 

you a quantity of merchandise ; you are Sometimes . g 0 

not entirely suited ; do you find it difficult No 4 7 . +1 

to say ‘No’? 

If you hold an opinion the reverse of that In class . . . +8 

which a leeturer has expressed in class, do After class 0 
Not at all . - —8 


you usually volunteer your opinion ? 


An alternative form is available for women, and an adaptation 


for children has been prepared. 


TESTS OF OTHER TRAITS 


A test for Inferiority Feelings, based on Adler’s writings, 
has been compiled by Heidbreder,? along the same lines 
as her introversion test. Bernreuter 4 published a test of 


1§choll, R., ‘ Untersuchungen über die teilinhaltliche Beachtung von 
Farbe und Form bei Erwachsenen und Kindern’. Zs. f. Psychol., 1927, 


101, 225-280. un 
2 Allport, G. W., ‘A Test for Ascendance-Submission °. J. Abn. oe 
Psychol., 1928, 28, 118-186. Allport, F. H., and Allport, G. W., 4- 


Reaction Study. Boston: Houghton Mifin, 1928. 
3 Heidbreder, E., ‘The Normal Inferiority Complex ’. J. Abn. Soc. 
par aah nt of Self-Sufficiency ’. J. Abn. 


4 Bernreuter, R. G., ‘ The Measureme 
‚Soc. Psychol., 1988, 28, 291-300. 
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1 has 
Self-sufficiency vs. Dependence on Others. Er. 5 
developed tests of Security-Insecurity and of Se is ET 
Dominance feeling (the latter for women only), on Te 
clinical studies of well and poorly adjusted ee, Bm 
and Chant and Myers ? have tests of Depression-Elation, 


i i ange 
latter being scaled by Thurstone’s technique. Its items rang 
from : 


A to, 
Everything in the world is against me (Score 0-9) to, 
Life could not be better for me (10-7). 


s ins 
Willoughby’s E-M (Emotional Maturity) Scale® contal 
similarly standardized items, SER 


S develops affective difficulty in the presence 
of a necessity for precise and realistic Score 2) 
thinking, e.g. mathematics, —— (Se 
S organizes and orders his efforts in pur- 
suing his objectives, evidently regard- 
ing systematic method as a means of 
achieving them. ae 
This is intended primar: ‘or 
ratings of a patient by a psychiatrist, but can also be used fi 
self-rating (at a sophisticated level), 
Wang’s 4 
opinion of 75 judges, sho 
non-persistent person, 


ring 
1 Maslow, A. H. Clinically Derived Test for ren 
Psychological Security-Insecurity’. J. Gen, Psychol., 1945, 33, College 
Maslow, A. H., ‘A Test for Dominance-Feeling (Self-Esteem) 7 awen 

+, 1940, 12, 255-270. Social Personality yey 
anford, Cal.: Stanford University Press, i 


A d its 
+» “The Measurement of Depression-Elation an! 


chol-s 
of Extraversion-Introversion. J. Abn. Soc. Psy 
1980, 25, 807-818. Chant, S. 


h 
Rag ond Myers, C. R., * An Approach g 
the Measurement of Mental Health; Amer. J. Orthopsychiat., 
184-140, Psychol» 
* Willoughby, R. R., ‘ A Scale of Emotional Maturity’. J. Soc. 091, now 
3082, 8, 8-86. Test published by Stanfors University Press, 1081, 
out of print. Psychol» 
“Wang, C. K. A, A Scale for Measuring Persistence’. J. Soc. 
1982, 3, 79-90, 
5 Cason, H., ‘An 


, 
Jems * 
Annoyance Test and Some Research Prob 
J. Abn. Soc. Psychol., 1980, 25, 224-286. 
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test (adapted for the Bennett-Slater questionnaire, p. 125). It 
lists 217 situations which the testee rates from 3 (extremely 
annoying) to 0 (not annoying). The average score can be used 
as measure of Irritability. 

Wallen’s 1 test of Food Aversions consists of twenty foods 
which are ticked for liking or disliking. Normal adults dislike 
an average of one or less, whereas neurotics average three to 
five aversions. Eysenck has found the test effective in this 


country also. 


MULTIPLE TESTS 


Tests such as Woodworth’s or Thurstone’s obviously contain 
a wide range of symptoms drawn from many different neurotic 
or psychotic conditions. It would be theoretically possible for 
several testees to give neurotic answers to entirely different 
sets of, say, twenty items, and yet get the same score and be 
labelled equally neurotic or unstable. Attempts have been 
made to classify the Thurstone Schedule items,? e.g. under 
Extravert-Introvert, Physical Disorders, Fantasy, etc., but 
these tend to inter-correlate too highly to be accepted as 
distinct. In Laird’s B-2 Inventory the items are classified as 
Psychasthenoid, Schizophrenoid, and Neurasthenoid. 

Cattell ® published a questionnaire with separate sets of items 
for seven syndromes—Neurasthenia, Anxiety Neurosis, Anxiety 
Hysteria, Conversion Hysteria, Obsessive-Compulsive, Epilep- 
toid, and Paranoid. Better known is Hathaway and McKinley’s 
Minnesota Multiphasic Personality Inventory (MMPI),* which 
is widely used in mental hospitals in this country as well as 
America. Its 550 statements are more varied than usual, 
including some dealing with interests and social attitudes. 
They are generally presented individually, on separate cards, 
1 Wallen, R., ‘ Food Aversions of Normal and Neurotic Males’. J. Abn. 


Soc. Psychol., 1945, 40, 77-81. ; 
i u AR“ e Properties of the Thurstone Personality 


1 Cf, Willoughby, R. R., ‘Som 
EV, cated Revision’. J. Soc. Psychol, 1982, 8, 401424. 


ee and London : University of 


3 Cattell, R. B., A Guide to Mental Testing. 
London Press, 1936. f 3 i 

‘Hathaway, S. R., and McKinley, J. C., ‘A Multiphasic Personality 
Schedule (Minnesota). I. Construction of the Schedule. J. Psychol., 
1940, 10, 249-254. Inventory published by Psychological Corporation, 
New York, 1942. 
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’ 
and sorted by the patients into ‘ True, False, and Cannot Say 
boxes. It may take anywhere from 80 minutes to several hous 
to complete. On the basis of the responses of 500 norma 
adults (16 to 55 years) and 800 miscellaneous patients, a series 


of empirical scoring keys has been developed, so that a profile 
is obtained showing relative scores on: HypochondriasiS, 
Depression, Hysteria, Psychopathic Deviate, Masculinity- 
Feminity, Paranoia, Psychasthenia, Schizophrenia, and Hypo 
mania. Note that the significance attached to an item depends, 
not on its manifest content, but on its correlation with an 
external criterion. Other keys can be and are being develope 

by various authors, e.g. for differentiating academically unsuc- 
cessful from successful students, Four additional scores 
provide checks on self over-evaluation, malingering, and other 
Sources of unreliability. The authors do not claim that the 
profile will provide an automatic differential diagnosis. = 
patients. The MMPI is a clinical instrument which require 
considerable skill in interpretation, and even then is likely 2 
agree with the psychiatric diagnosis only in 60% of cos 
(according to critics the figure is less than 50%). It is mor 
successful in differentiating abnormal persons in general from 
normal. While experienced 


detailed insight into a personality from the pattern of ee 


Another test with a Psychiatrie background is the Hunt 
Wadsworth Temperament Scale.? Tt aims to measure Sa 

components ° distinguished by Rosanoff : Normal, Hyster’: f 
Manic Cycloid, Depressive Cyeloid, Autistic Schizoid, Parano: of 
and Epileptoid. Scoring keys were based on the responses ng 
groups of patients, criminals, and normals known to be stro re 
or weak on these components. Some of the 318 items ars 
for more than one component, while others are not score S5 
all, since they did not differentiate between any of the gt pee 
but they were left in the final form of the test for the ote 
Teason that the value of an item may be affected by its conte 


: tof 
„ Cf. Gough, H. G., ‘ Factors Relating to the Academic Achieveme? 
High-School Students e J. Educ. Psychol., 1949, 40, 65-78. worth 
umm, D. G., and Wadsworth, G. W. “The Humm-Wads Scale 
Temperament Scale’. Amer. J. Psychiat., 1935, 92, 163-200- 
published by Humm Personnel Service, Los Angeles, Cal., 1940. 
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This has the disadvantage of making the test rat! 
average time for answering being 55 minutes. 
esting feature is that the proportions of Yes’s and No’s to the 
test as a whole provides a check on the testee’s conscientiousness, 
Negativistic persons tend to give an undue number of No’s, 
highly suggestible people too many Yes’s. Humm insists that 
it is the pattern or profile of scores on all the components, 
considered in the light of these checks, which enable the well- 


weakness among employees. Other 
only moderate or poor validities,1 


achieved enormous 
tion. It claims to 


and Allport, etc.) is scored for each trait. 


For example the 
responses to: ‘Do you day-dream frequ 


ently ?’ are scored : 


Neurotic Introversion Dominance 


Self-Sufficiency 
Yes +5 +3 =I +1 
No > s —4 —4 +1 —1 
Doubtful $ —2 0 +2 +2 


obtained empirically by contrasting the 
responses of high and low scorers on four established scales— 


urstone’s, Laird’s C-2, Allport’s, and Bernreuter’s own Self- 
ufficiency test. In this instance the external criteria for item- 


ible, the resulting inventory scores 
Show odd fi i 


» correlating 
nearly the reverse of both, its 


i 4: Appl: Psychol., 1944, 28, 302-307. Guilford, J. P., and 
acey, J, I., Printed Classification Tests. Army Air Forces Aviat, Psychol, 
- Rep. No. 5. Washington, D.C.: U.S. Government Printing 
Office, 1947. 


ern E z 
Invento R. G., ‘The Theory and Construction of the Personalit 


+ J. Soc. Ps chol., 1988, 4, 887-405. Invento; ublished b 
Stanford University En 1981 7 
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correlations being —-81 and — 67. Self-Sufficiency is relatively 
distinct, though overlapping moderately with Dominance ine 1 
correlations are — 41, —-32, and +-58. Flanagan? app on 
factor analysis to these scores and showed that the test Bann 
effect measuring two, not four things. The first, a compou a 
of Neurotic, Introverted and low Dominant, and Self-Sufficie 
scores, seems to represent general Lack of peli -Confidence a) 
second and smaller factor may be denoted as Sociabili t 
Flanagan constructed a fresh set of keys so that responses cou 
be scored for these two factors. y dat 

Bell's Adjustment Inventory * is another test widely use ie 
high school and college level. Its 160 items are grouped un 
four headings—Home, Health, Social, and Emotional Adjus 
ment. ite 

Boyd’s Personality Questionnaire 3 is the only one to ha 
been used at all extensively among British university students. 
Its 120 items are classified under twenty headings or traits, 
including the following : 


Trait Sample Question h 
Obsessional Carefulness Do you often go over a job again and again 
to make it just right ? : 
Worry, Anxiety Do you brood long over humiliating OY 


unhappy experiences ? 
Do you sometimes suspect that people are 
talking about you ? . 
Self-consciousness Are you greatly interested in what goes on 1n 
your own mind ? 


Suspiciousness 


Testees are not told about these traits, and the questions 
are so arranged that they are unlikely to guess that cs 
deal with carefulness, six with worry, ete. Each question T 
answered Yes, Yes ?, 0, No? or No, or omitted; es 
scored 4 to 0. Thus there is a possible range of 24 to ra x 
each trait. Naturally the traits overlap considerably, an A 
factor analysis by the writer of the scores of 100 studen 


1 Flanagan, J. C., Factor Analysis in the Study of Personality. Stanford, 
Cal. : Stanford University Press, 1935. iversity 

2 Bell, H. M., Adjustment Inventory. Stanford, Cal. : Stanford Univ: 
Press, 1934. sept- 

? Boyd, W., ‘A New Personality Test’. Scot. Educ. J., 1939, 
lst-15th, 998-999, 1014-1016, 1024-1025. 
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indicated that they could be boiled down to three or four 
distinct tendencies : 


(1) Self-depreciatory and psychoneurotie tendency—a general 
factor particularly strong in the scores for Depression, In- 
stability, Worry, Lack of Self-Control, Shrinking Responsibility, 
and Lack of Confidence. 

(2) ‘ Care-freeness ’, most marked in Shrinking Responsibility, 
Suggestibility, Inability to Concentrate, Lack of Definite 
Interests, and in low scores on Worry, Self-Consciousness, 
Emotional Thinking, Dissociation, and Tenseness. 

(8) ‘ Serupulousness ’, most marked in Obsessional Careful- 
ness, Acting Readily without Pressure, Suspiciousness, Self- 
Control, and Low Instability, Emotional Thinking, and 
Inability to Concentrate, 

(4) A sex difference factor. 


These results indicate the dangers of taking questions at 
their face value. Even when multiple tests are devised to 
measure traits distinguished by factor analysis, the resulting 
scores tend to be far from distinct. An example of this is 
provided by the: 

Guilford-Martin Temperament Profile Chart! In several 
publications Guilford has analysed correlations between 
typical items from extraversion-introversion tests, and claimed 
to break down this trait into separate components. Two of his 
tests attempt to measure such factors, namely : 


Inventory of Factors STDCR Inventory of Factors GAMIN 
Social Introversion General Pressure for Overt Activity 
Thinking Introversion Ascendancy in Social Situations 
Depression Masculinity of Attitudes and Interests 
Cycloid Disposition Inferiority Feelings 

Rhathymia (Carefreeness) Nervous Tenseness and Instability 


Actually the correlations between these sub-scores are so 
high that, in the absence of validation against external criteria, 
they all seem to measure much the same introversion-neurotieism 


1 Guilford, J. P., and Guilford, R. B., ‘ Personality Factors S, E, and M, 
and their Measurement’. J. Psychol., 1936, 2, 109-127. Martin, H. G., 
‘The Construction of the Guilford-Martin Inventory of Factors 
G-A-M-LN’. J. Appl. Psychol., 1945, 29, 298-800. The three inventories 
published by Sheridan Supply Co., Beverly Hills, Cal., 1940-1943. 

IO R 
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è i d 
factor as does the Bernreuter Inventory.! A third test ae, 
in Guilford’s Profile Chart is his Personality SEE. in 
aims to pick out trouble-makers and paranoid indivi Pa 
business and industry by means of three groups ven 
measuring : Objectivity, Agreeableness, and Co-operati a at 
It has been shown to correlate moderately with mannen A 
employees,? but whether it would work equally en 
selection test, when testees are on the defensive, is not oa 

Cattell’s 16 P.F. Test. Perhaps the most ambitious ques uve 
naire is the one constructed by Cattell to cover the in 
“ source traits ’ of his personality factor analysis, together ven 
four additional traits (radical-conservative, selten 
will-control, and nervous tension). There are two Plage 
forms, each containing 187 items. They are designed for co Te 
student level, but a children’s edition is being prepared. va 
amount of overlapping among the sixteen scores is not sta 
and no evidence of validity is so far available. 


QUESTIONNAIRES FOR CHILDREN 


ucted by trained psychologists. a 
pectiveness, contrasuggestibility thoes 
€s, some American rend. 

-person questions, similar to those of a 
ho ratings. 


one ere 
Maller’s Character Sketches 4 contains 200 short descriptions + 


i the 
the testee has to say whether or not he feels or acts like 
person described. For example : 


. to 
—— This person insists on having his own way and likes 
. command and rule everybody. nt 
—— This person finds it difficult to forget unpleasa 
memories and can’t help thinking about them. 


ity 

1 Cf. Lovell, C., ‘ A Study of the Factor Structure of Thirteen Personali 

Variables’, Educ. Psychol. Measmt., 1945, 5, 335-850. 
aCe Dorcus, op. cil., p. 131. à ire. Cham- 
3 Cattell, R. B., et. al., The 16 Personality Factor Questionnaire. 

paign, Ill. : Institute of Personality and Ability Testing, 1950. ‚nes, 1936- 
4 Maller, J. B., Character Sketches, 1932; Personality Skel eations- 

New York: Teachers College, Columbia University, Bureau of Pu 
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Each item is repeated in reverse form, elsewhere in the test, as 
a check on consistency ; e.g. : 


This person never insists on having his own way and 
does not like to command and rule everybody. 


All questions have been shown to differentiate significantly 
between groups of 308 delinquent or problem cases and 310 
normal pupils. They are classified under the following six 
headings, which are admitted to overlap to a moderate extent : 


Personal adjustment (freedom from 
anxiety) 

Mental health (freedom from psycho- 
tic or neurotic symptoms) 

Social adjustment (extraversion) Readiness to confide in others 


Desirable character traits 


Self-control and integration 


A later edition, Personality Sketches, consists of 100 items, 
presented on cards, so that no written response is required. 
Pintner’s Aspects of Personality * contains thirty-five items 
for measuring each of three traits—ascendance-submission, 
extraversion-introversion, and emotionality. It is sufficiently 
simply worded to be applicable from about 10 to 15 years. 
Sanders,? in Australia, has developed a test for individual 
o 9- to 13-year-old boys, containing third-person 
h Physical and Economic Insecurity, Social 
and Non-Social Tendencies. The two latter 
sections are highly correlated. He finds that delinquents tend 
to give high scores on all sections, and that anxious or neurotic 


boys surpass normals on the second and third. 

Several other inventories and questionnaires, either for 
American adolescents or for the long-suffering college student, 
are listed and critically reviewed in Buros’s Year books. 


application t 
items dealing wit! 
Under-evaluation, 


BIOGRAPHICAL INVENTORIES 


the 1920s, studied the predictive value of 


Hollingworth, in 
tion blanks by candidates for employment, 


responses to applica’ 


1 Pintner, R., and Forlano, G., ‘ Validation of Personality Tests by 
Outstanding Characteristics of Pupils’. J. Educ. Psychol., 1939, 80, 
25-82. Test published by World Book Co., Yonkers, NY; 1938. f > 
2 Sanders, C., ‘ Insecurity and Social Maladjustment in Children °. 


Brit. J. Educ. Psychol., 1948, 18, 148-155. 
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and found that a weighted score derived from the most on 
questions might correlate well with subsequent success. De 
validation necessitates very large numbers, thus this techniq = 
was useful in selecting pilots, officers, or other large groups i 
the Second World War.1 Inventories were devised conta £ 
a hundred or more multiple-choice questions which Coran a 
wide range of mainly factual information—educational Sae 
occupational career, financial status, skills and trade ER el 
home background, marital record, athletic and leisure semai by 
health, ete. The relevance of each answer was determine ‘lin 
giving the inventory to groups of, say, successful and oe 
Pilots, and a purely empirical scoring key was developed. Fie 
same inventory, with a different key, could be used for naviga ae 
or other groups. Note that this type of questionnaire does n! 
set out to measure i ied 
certainly the most useful of all the personality measures Hha 
out in the U.S. Army Air Force. Unfortunately it is applica 


work only with adults of good intelligence. It might well be 
tried in the selection of university students. 


DISCUSSION OF PAPER-AND-PENCIL 
PERSONALITY TESTS 


The weaknesses of the questionnaires described above HR, 
So obvious that their use in countries other than America ee 
een confined almost entirely to a few tentative ee a 
investigations. But in America they have not only been ha at 
in hundreds of studies with pupils and students, but are is Be 
least used to be) applied regularly in mental hospitals, c a for 
and schools, for detecting problem cases and neurotics, an ade 
educational or vocational guidance. They are so easily m 


iplio- 
*Cf. Guilford and Lacey, op. cit, p. 181. Vernon and Parry, TE n 
graphy. Stuit, D. B. (edit.), Personnel Research and Test Deve lop’ i 


Pi ivers!' 
the Bureau of Naval Personnel. Princeton, N.J.: Princeton Univ: 
Press, 1947. 
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Ellis, there are signs of greater caution, and of some realization 
of the importance of the testees’ attitudes to the tests. 

The majority of the questions deal with personal matters 
which one might discuss with a sympathetic and trusted friend, 
or a psychoanalyst, but would certainly hesitate to commit to 
writing for some relatively unknown tester to read. Many 
experiments have in fact shown that when people do not have 
to give their names they admit to larger numbers of symptoms 
of maladjustment. Interesting studies by Smith, and Sletto,? 
compared items set in a positive or socially acceptable form with 
similar ones in negative form, e.g. : 


Feels people speak well of him and like him. 
Feels people criticize him and dislike him. 


It was found, (a) that more people admitted that the positive 
form did not apply than that the negative form did apply ; 
(b) that the correlation between scores on the two types was 
not high; (c) that the negative type showed higher internal 
consistency. Presumably this means that negative items all 
tend to arouse a suspicious, hostile, or defensive attitude. 
Hence testees answer them all alike, regardless of their real 
meaning. Positive items, however, are considered more 
calmly, and so evoke a greater diversity of response and lower 
consistency. Thus the common application of the internal 
consistency technique of item analysis tends to produce a 
piling up of negative items.* 

Many tests, such as Allport’s A-S, try to draw on recollections 
of objective behaviour rather than on feelings, and it has been 
shown that answers to such items tend to be more stable or less 
likely to alter if the test is repeated (although their internal 
consistency may be lower). But affective reactions naturally 
enter into these just as, in the case of external ratings, halo 
continues to operate however objectively the traits are defined. 
Although the name of the trait or traits at which a test is aimed 


1 Cf. Ellis, Bibliography. 

2 Smith, R. B., ‘ The Development of an Inventory for the Measurement 
of Inferiority Feelings at the High School Level’. Arch. Psychol., 1982, 
22, No. 144, Sletto, R. F., ‘A Critical Study of the Criterion of Internal 


Consistency in Personality Scale Construction’. Amer. Sociol, Rev., 1936, 


1, 61-68. ; 
R. R., and Morse, M. E., ‘ Spontaneous Reactions to 


s Cf. Willoughby, p 
a Personality Inventory’. Amer. J. Orthopsychiat., 1986, 6, 562-575. 
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is usually withheld, and some non-committal title like Inventory 
or A-S Study is given, it is obvious that testees will make their 
own guesses as to the object of the test, and will answer each 
question not so much at its face value as in accordance with 
their own interpretation of the object, and with how much 
they are willing to reveal. 6 

‘This explains why, on the one hand, different tests of nomin- 
ally the same trait, given on different occasions, often show 
remarkably poor agreement, whereas on the other hand tests of 
nominally different traits given in the same experiment (par- 
ticularly if combined together into a multiple test) tend to 
show much too high correlations. In fifty-five studies listed by 
Ellis, the median correlation between tests claiming to measure 
the same trait was -40. The present writer found a figure not 
much lower, namely -84, in fifty-eight studies where introversion, 
psychoneurotic, tendency and submission-ascendance were 
inter-correlated. The far higher overlapping in the Bernreuter 
and other multiple tests has already been pointed out.! This 
strong general factor which runs through the responses to 
extremely diverse questions in any one testing situation surely 
reflects the testee’s halo and general attitude to this situation. 

It is not only the subject’s willingness to co-operate that 
affects his responses, but also his unconscious resistances: 
People literally do not know themselves well enough to answer 
many of the questions correctly ; their responses are only too 
likely to be rationalizations or unwitting self-deceptions- 
Alexander 2 has compared questionnaires with psychoanalytic 
techniques, pointing out that the psychoanalyst would never 
expect to get valid information from direct questions or intro- 
spections, where conscious criticism is at its maximum. Another 
important factor is suggestion. The well-known experiments 
on testimony show how liable to falsification are our recollections 
of emotionally toned experiences, and how easily suggestive 
questioning may lead us to accept experiences as our own whic 
never really occurred. Thus while most testees may be expected 

1 A relevant finding by the Research Branch, Information and Education 
Division of the U.S. Army, was that its NSA test of 15 items dealing, yt 
psychosomatic symptoms was as effective in screening neurotic recruits er 
a multiple test of over 100 items, which were carefully compiled to cov“ 
all the main aspects of maladjustment. 


in 
* Alexander, F., ‘Evaluation of Statistical and Analytical Method) g, 
Psychiatry and Psychology’. Amer. J. Orthopsychiat., 1934, 4, 48 


Self-Ratings and Personality Questionnaires 189 


wittingly or unwittingly, to disguise their emotional weaknesses 
in answering personality questionnaires, others of a more sug- 
gestible type may greatly exaggerate. This was brought out in 
a study by Hollingworth,! who applied the Woodworth inventory 
to groups of soldiers in a mental hospital shortly before the 
1918 armistice, and to other similar groups shortly after. The 
average incidence of neurotic symptoms was about twice as 
great in the former, presumably on account of their conscious 
or unconscious fear of being returned to active service. 

Yet another factor appears to raise scores in the well- 
educated and academically minded. It is a remarkable fact 
that university students and professional people obtain much 
higher average psychoneurotic and introverted scores than do 
the relatively uncultured ; not infrequently they are found to 
be as unstable as neurotic and psychotic mental hospital 
patients. (There is a slight tendency also for the better students 
to be more introverted and neurotic, though the evidence on 
this point is somewhat contradictory.) Conceivably such 
persons are more neurotic than the less educated classes, but 
it is just as likely that they are also more self-analytic, more 
used to verbalizing their emotional experiences, more willing 
to admit to themselves and to the tester the possession of the 
symptoms which the tests describe. Thus the explanation of 
the high consistency and overlapping of questionnaire tests 
probably lies in these various distorting attitude factors. High 
scorers are not necessarily the most neurotic, introverted, 
submissive, lacking in confidence, nor given to fantasy, shrinking 
responsibility, instability, and inferiority feelings; they may 
be the more sophisticated and introspective, or the more 
suggestible, or the more willing to co-operate with the tester. 

The influence of temporary mood—optimism, worry, ete.— 
might also be thought to affect test responses. An experiment 
with the Bernreuter test by Johnson 2 showed that this does 
occur, though only to a very slight extent. Objections are often 
raised, again, to the rigid limitation of testees’ responses to Yes, 
No, or at most to five steps for each item. It is true that this 
worries many educated testees, whose natural reactions to the 

1 Hollingworth, H. L., The Psychology of Functional Neuroses. New 
York : Appleton, 1920. 

2 Johnson, W. B., ‘ Th 
by Bernreuter’. J. Soc. Psych 


e Effect of Mood on Personality Traits as Measured 
ol., 1984, 5, 515-522. 
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questions are infinitely varied. Eisenberg! carried out an 
introspective study of what different subjects meant when they 
selected a given response to a typical question, and showed that 
there were enormous variations. However, this is less serious 
than it sounds since variations in interpretation will tend to be 
random, and to cancel one another out when total scores are 
considered. It is when all the variations take the same 
direction, as when the ‘ tough-minded ’ or the resentful testee 
underestimates his instability throughout, that they upset the 
test results. 

A much less obvious distortion enters whenever testees are 
allowed to omit questions, to answer by question mark, or to 
give various grades of response. Some people are much more 
cautious or non-committal than others. For example, in the 
Boyd questionnaire it was found by the writer that the propor- 
tions of extreme responses (definite Yes’s and No’s as contrasted 
with Yes ?, 0, or No ?) ranged from 17% to 98%. Cronbach 
calls this factor “response-set” and shows that it is fairly 
persistent or general from one test to another. (It occurs also, 
of course, in associates’ ratings and in the markings of examina- 
tions, where some markers adopt a much wider spread than 
others.) In ability tests, as we have seen, it may give some 
indication of impulsiveness-caution, but its psychological 
significance in personality questionnaires is very dubious. KOE 
the most part it is a quite irrelevant feature of the testees 
interpretations of the test instructions, which nevertheless 
operates to raise all high, or lower all low scores. Such tests as 
MMPI and Humm-Wadsworth try to provide checks, an 
recognize that abnormal sets may invalidate all test scores. 


Cronbach recommends the use of forced-choice items in order 
to eliminate it. 


VALIDITY 


However, the constructors of questionnaires can justifiably 
answer that speculations about testees’ attitudes and interpr 
tations do not concern them, provided that the tests work: 


= Eisenberg, P., ‘ Individual Interpretation of Psychoneurotic Inventory 
Items’. J. Gen. Psychol., 1941, 25, 19-40. Test 

? Cronbach, L. J., ‘Further Evidence on Response Sets and 
Design’. Educ. Psychol. Measmt., 1950, 10, 3-81. 


ee 
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The important thing is whether they correlate with other 
evidence of the traits. There is no need for us to survey 
validation studies in detail, since this has been done by Ellis. 
He outlines the results of 880 relevant investigations and 
concludes that, with two exceptions, these are generally 
unfavourable to personality questionnaires. Perhaps his 
standards of ‘ favourable’ are unduly high. Thus he regards a 
correlation of -4 as only ‘ questionably positive’. We would 
agree that this level of validity is too low for making predictions 
about individuals, but it does indicate that questionnaires 
have some value, especially if combined with other kinds of 
tests. In just over half of the following 217 investigations the 
results were positive or questionably positive : 


Comparisons of the scores of behaviour prob- 
lem children and of delinquents with 
normal children : 

Comparisons of neurotic and psychotic 
adults with normals : (45 out of 75) 

Correlations with ratings by associates on the 
traits at which the tests are aimed : (22 out of 44) 


Correlations with other tests of the same 
traits : (27 out of 55) 


(24 out of 48) 


The fourth category consists entirely of comparisons with other 
questionnaires ; thus it indicates weak reliability rather than 
moderate validity. However, in the present writer’s research, 
the mean correlation of five questionnaires with trait composites 
(chiefly made up of ratings and objective tests) was -45. 

More striking than the average result is the wide range of 
success and failure of questionnaires, even in similar investiga- 
tions. This does not seem to be mainly due to differences in the 
value of different questionnaires, although Ellis does show that 
the Bell and Bernreuter Inventories, the Thurstone Schedule, 
and Woodworth questionnaire tend to be the least successful. 
More probably it is a matter of the subjects’ attitudes. Two 
studies of the Thurstone Schedule among college students 
provide an interesting contrast. Hanna 1 gave it to 179 students 
who applied for psychological or vocational guidance at a 
college clinic, who, therefore, presumably thought that it 

1 Hanna, J. V., ‘ Clinical Procedure as a Method of Validating a Measure 
of Psychoneurotic Tendency’. J. Abn. Soc. Psychol., 1934, 28, 435-445. 
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would help them if they answered it really frankly. Independent 
estimates of their emotional stability were made by clinical 
psychologists and good agreement was found with the Schedule 
scores (corresponding to a correlation of over 5). Moran,! on 
the other hand, showed that 41 students who were classified 
as maladjusted scored scarcely any higher than 146 well- 
adjusted students. But here the Schedule (in abridged form) 
was taken along with various tests of abilities at the beginning 
of the college year, so that many students may have thought 
that the authorities would be influenced by the kind of picture 
they drew of themselves. 

Two exceptions were mentioned to Ellis’s general condemna- 
tion. The first is the Minnesota Multiphasic Inventory. 
considerable majority of studies have shown significant differ- 
ences between various abnormal groups and normals, though 
the validity of the test for differentiating among different 
abnormal types is less well attested. Usually, of course, the 
test is administered individually, and patients are more likely 
to answer as they would to oral questioning by a psychiatrist. 
Secondly, there is no doubt that such inventories as the NDRC, 
NSA, Cornell and others were of value in screening abnormal 
recruits during the war. The great majority of seventy-one 
studies showed significant agreement with subsequent psychia- 
tric diagnoses. Harris’s 2 study of 2081 naval recruits who 
answered NDRC and Cornell (in about 6 minutes) is repre- 
sentative. Of these 297, or 143%, scored above a certain 
borderline, and 52 of them were discharged after psychiatric 
interview or as a result of referrals during 10 weeks’ training ; 
another 16, not ‘caught’ by the tests were also discharged. 
Thus three-quarters of the discharges were diagnosed, an 
11-7% of acceptable recruits were incorrectly picked out. 
Similar tests were applied experimentally, but not regularly; 
in the British Services. On the whole they were less successful. 
Thus Eysenck found some correlation between the Maudsley 
Medical Questionnaire and neuroticism among recruits, but 
this was smaller than that of the body-sway suggestibility, leg 
persistence, dark vision, and other objective tests. 


1 Moran, T. F., ‘ A Brief Study of the Validity of a Neurotic Inventory "- 
J. Appl. Psychol., 1935, 19, 180-188. r 

? Harris, D. H., ‘ Questionnaire and Interview in Neuropsychiatric 
Screening’. J. Appl. Psychol., 1946, 30, 644-648. 
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Ellis and Conrad point out that such tests always produced 
large numbers of ‘false positives’, i.e. normals with high 
neuroticism scores. Nevertheless, they did save a considerable 
amount of psychiatric interviewing time by picking out most 
of the potential neurotics. Much of their good validity could 
be attributed to ‘ criterion contamination ’, that is, to the fact 
that they asked questions similar to those asked orally in a 
psychiatric interview; indeed often the psychiatrists would 
know, and be influenced by, the test scores. Thus the results 
were far less favourable in thirty-six studies where tests were 
checked against objective criteria such as failure in training. 
For example, several tests tried out on USAAF pilots gave very 
small or zero correlations with learning to fly. No satisfactory 
investigations of their value in predicting breakdown in combat 
appear to have been carried out. 

In so far as questionnaires do work better in military than 
civilian contexts, it is probably due to the tremendous hetero- 
geneity of samples of recruits, and to better motivation. 
Recruits may be more candid, either because of military 
discipline, or because they may assume that scores indicating 
abnormality will be to their advantage. Further, the better 
tests were carefully constructed from items each of which had 
been proved to differentiate abnormals from normals, in contrast 
to the haphazard collection of items in most civilian tests, 
described at the beginning of this chapter, plus the misleading 
internal consistency check.! It is noteworthy that the civilian 
tests which on the whole give the best results such as Allport 
A-S, MMPI, Humm-Wadsworth, and Character Sketches, also 
consist of empirically validated items. 

AW e may conclude then that, despite their extreme weaknesses 
and dangers, paper-and-pencil personality tests and question- 
naires should not be entirely condemned. Well-constructed 
ones, given under suitable motivating conditions, can be of 
value both for experimental research, and in clinical or other 

j 


applied psychological work; 


1 Cf. the useful discussion by Stuit, op. cit., P- 136. 


IX 
Measurement of Attitudes and Interests 


DE techniques which, as we have seen, work rather badly 

in the measurement of emotional traits, are consider- 
ably more successful in measuring social attitudes, opinions, 
and interests; for example, radicalism vs. conservatism, 
nationalism, favourableness or unfavourableness to religion, 
to birth control, or to coeducation, liking for particular school 
subjects or occupations, ete. The term ‘attitude’ has been 
used by psychologists in a great many senses, and there 18 
no agreed definition.1 But in this context it generally implies 
a personality disposition or drive which determines behaviour 
towards, or opinions and beliefs about, a certain type of 
person, object, situation, institution or concept. It includes 
both McDougall’s ‘sentiments’ and the medical psychologist’s 
“ complexes ’, though it is not necessarily thought to arlse 
either from innate instinets or from repressed wishes. Often 
our attitudes are adopted ready-made, as it were, from our 
parents, teachers, or friends, though usually modified by our 
own experiences, 

Note that, as in the case of personality traits, attitudes may 
be expressed either through behaviour or through verba 
statements, and that not infrequently these may seem to be 
inconsistent or contradictory. Katz and Allport ? point out 
also that our publicly admitted attitudes may differ considerably 
from our deeper and more private feelings. A man’s pro- OF 
anti-semitism, for example, is exceedingly complex, and co 
not be assessed satisfactorily merely by asking him if he liked 
Jews, nor only by observing such actions as his willingness tO 
patronize Jewish shops. It would be desirable to sample both 
representative acts and opinions. Actually the majority pi 
attitude tests sample verbal expressions only, and the extent 


1Cf. Allport, G. W., ‘ Attitudes’. A Handbook of Social Psychology 
(edit. C. Murchison). Worcester, Mass. : Clark University Press, 1935. 
* Katz, D., and Allport, F. H., Students’ Attitudes. Syracuse, N-¥+ 
Craftsman Press, 1931. 
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to which they penetrate to the ‘ private’ level is doubtful. 
But they are none the less useful provided that these limitations 
are realized, and that they are not assumed to be predictive of 
behaviour without further evidence. And we shall see that 
quite an amount of evidence has been collected showing that 
they do correlate with behaviour, at least to a moderate 
extent. 

The notion of broad sampling is important in attitude 
measurement, both because any single verbal statement may 
give an inaccurate index of a person’s more general attitude, 
and because of the need to avoid stereotyped value judg- 
ments. If a man is asked straight out how religious he is, 
how tolerant to foreigners, etc., his answer will inevitably 
be biased by what he regards as socially respectable. Thus 
it is better to break down the attitude into a number of 
more concrete manifestations, and ask what he did in particular 
situations, or what he thinks about specific points (just as 
in analytic rating scales). For example, how often does 
he go to what church services, what religious books has he 
read recently, does he accept such-and-such beliefs? It 
does not matter if some of the items seem only doubtfully 
relevant to, or partially dependent on, the general attitude., 
Statistical analysis will show whether they are too remote to 
be included. 

Whatever kind of test is adopted, the conception of attitude 
measurement necessarily involves a unidimensional variable, 
that is a definite object or issue towards which some people are 
more favourable than others. This requirement has given rise 
to considerable criticism ; it is said that peoples’ attitudes on 
any topic show an infinite variety of qualitative differences, and 
that they cannot be arranged along a single scale without 
distortion. For example, liberals do not merely hold views 
intermediate between those. of socialists and conservatives. 
But the answer to this is that, if other important dimensions 
exist, they should be measured separately. Thus if liberals 
do differ both from socialists and conservatives in favouring 
greater freedom for the individual (or any other doctrine), 
then an appropriate test should be devised. As in the wider 
sphere of personality, factor analysis is of the greatest assistance 
in showing what attitudes are sufficiently unidimensional to be 
suitable for measurement, which are distinctive and which 
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overlap so much that they are better combined.1 On many 
issues, shades of opinion are indeed too varied and unpolarized 
to be readily measurable. For example, in one research in 
London on the value of school visits (to museums, factories, 
etc.) it was not found possible to produce a satisfactory test of 
children’s attitudes. They almost all liked visits, and though 
many recognized various drawbacks, there was no single clear 
continuum from pro- to anti-. This is very apt to occur among 
children ; adults’ attitudes are generally more crystallized. The 
most pervasive factors running through the political, social, and 
religious opinions of adults are two bipolar tendencies which 
contrast: (a) progressive or radical with conservative, and 
(b) authoritarian or ‘ tough-minded’ with tolerant attitudes. 
But numerous sub-factors have been described in the literature. 
For example, Cattell claims that most attitudes can be grouped 
under McDougall’s list of instincts. 


PUBLISHED ATTITUDE TESTS 


Many readers may be more concerned with techniques of 
attitude test construction, which they can apply to their own 
social, educational, or other researches, than with published 
tests. However, a few of the better known tests will be 
described or listed for illustration. 

Thurstone’s Scales.2 Thurstone and his collaborators have 
produced a whole series of scales of attitudes to the Church, 
war, negroes, communism, capital punishment, ete. There arè 
many other scales similar in form: Peterson’s Attitude to War; 
Remmers’s generalized scales for measuring attitudes to any, 
school subject, or any social institution, ete. (The latter are © 


1 Cf. McNemar, Bibliography ; Cattell, R. B., Description and Measure” 
ment of Personality. London: Harrap, 1946. Vernon, P.E., ‘A Study 
of War Attitudes’. Brit. J. Med. Psychol., 1942, 19, 271-291. Eysenck 
H. J., ‘Social Attitudes’. Current Trends in British Psychology (edit- 
C. A. Mace and P. E. Vernon). London: Methuen, 1953. 

2 Thurstone, L. L., et. al., Scales for the Measurement of Social Attitudes- 
Chicago : University of Chicago Press, 1930. Also, Thurstone and ChaY& 
Bibliography. 4 

® Peterson, R. C., Scales for Attitude Toward War. Chicago: University 
of Chicago Press, 1930. Remmers, H. H., et. al., Attitude Scales. Lafayette 
Ind.: Purdue University, Division of Educational Reference, 1 ‘ag 
Remmers, H. H., and Silance, E. B., ‘ Generalized Attitude Scales * 
J. Soc. Psychol., 1934, 5, 298-312. 
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very dubious value.) In Britain, Jordan ! has published a scale 
for children’s attitude to French, and numerous other scales 
on various educational and social questions are contained in 
student M.A. and Ph.D. theses.? Each of these consists of 
about twenty statements graded on a 1 to 11 scale from 
highly favourable to highly unfavourable, but printed in 
random order. 
The following examples are from Jordan’s scale : 


I think that it takes so long to learn a foreign language 
that the attempt is not worth while (2-0) ; 

I like to listen to French talks on the wireless, because I think 
it will improve my knowledge of the language (9:5); 

I only borrow French books occasionally from the school 
or public library (7-6). 


The testee ticks the statements with which he agrees and 
the median or mean scale value of these gives his attitude 
score. F 

Multiple-choice Tests of Radicalism-Conservatism and Other 
Attitudes. Lentz’s 3 C-R Opinionaire contains fifty statements 


such as: 


The metric system of weights and measures should be 
adopted instead of our present system. _ : 
Even in an ideal world there should be protective tariffs. 


Conscience is an infallible guide. ; 1% 
Armistice Day should be celebrated with less martial spirit. 


Each of these is answered -+ for agreement or — for disagree- 
ment. They are not graded as in Thurstone-type scales ; 
instead the testee’s score consists of the number of state- 
ments he answers in the conservative direction (i.e. No in 
the first and fourth examples, Yes in the second and third). 
Alternatively the multiple-choice or ‘ cafetaria ’ question may 


1 Jordan, D., ‘ The Attitude of Central School Pupils to Certain School 
Subjects, and the Correlation between Attitude and Attainment’. Brit. 


J. Educ. Psychol., 1941, 11, 28-44. 
2Cf Blackwell, A. M., ‘A List of Researches in Education and Educational 


Psychology. London : Newnes, 1950. r ? 
3 Lentz, T. F., et. al., C-R Opinionaire. St. Louis, Mo.: Washington 


University, Character Research Institute, 1935. 
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be used, as in this example, abbreviated from Vetter’s* 
questionnaire : 


What are your views on HEREDITARY WEALTH ? 

1. All wealth should revert to the State at death. 

2. Taxes should confiscate the bulk, leaving only enough 
for support of dependents. 

8. Inheritances should be taxed ona rapidly graded scale, 
up to about 50% for large fortunes. p 

4. Very large fortunes should pay a reasonable inheritance 
tax, but not so high as to become confiscatory- a 

5. Individual thrift and initiative should not be dampe 
by any inheritance taxation. 


Here the score is based on the grade of responses to several such 
questions. : 

Likert ? has published tests of Internationalism, Imperialism, 
and Attitude to Negroes, and all tests with multiple-choice 
answers are sometimes referred to as Likert-type, as contraste 
with Thurstone-type. In point of fact, Likert did not origina 2 
a new type of question, but only a technique of weighting E 
graded responses to any question according to their centroids OF 
z-values. This technique has not been generally adopt 
because the simpler method of weighting the responses 5, 4, 9s 
2, 1 or +1, 0, —1, ete. gives practically the same results. 

Eysenck’s 3 test of anti-semitism combines the Thurstone a 
multiple-choice types. Thus the following two statements par 
graded responses : Strongly Agree, Agree, Undecided, Disagre® 
Strongly Disagree. 


and 


The Jews have too much power and influence in this 
country (6-9). ny 

The Jews have survived persecution because of the ma 
admirable qualities they show (1:0). 


. 3 d 
The first is much more strongly anti-semitic than the seeti 4 
and possesses a higher Thurstone scale-value. This is allo 
, nd 
1 Vetter, G. B., ‘ The Measurement of Social and Political Attitudes. 180. 
the Related Personality Factors’. J. Abn. Soc. Psychol., 1980, 25, 14 Arch. 
* Likert, R., ‘A Technique for the Measurement of Attitudes °. 
Psychol., 1982, 22, No. 140. es inion- 
? Eysenck, H. J., and Crown, S., ‘ An Experimental Study in Opia 
Attitude Methodology’. Int. J. Opinion Attitude Res., 1949, 3, 4T- 
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for by weighting the five responses to the two statements: 
2, 8, 4, 5, 6, and 8, 6, 4, 2, 0 respectively (a high score repre- 
sents pro-semitism). 

Other tests deserving special mention include Neumann, 
Kulp and Davidson’s Test of Internationalism, Hartshorne and 
May’s ethical attitude scales, Rundquist and Sletto’s studies of 
the morale of unemployed men, and the extensive series of 
scales applied in the American Army by Stouffer and others.! 
An interesting scale recently constructed by Shoben ? contains 
eighty-five items representing domineering, possessive, rejective, 
or other undesirable parental attitudes regarding child-upbring- 
ing, which have been proved to differentiate mothers of problem 
children from mothers of normal children, e.g. : 


A child should be seen and not heard. Strongly Agree, 
Mildly Agree, Mildly Disagree, Strongly Disagree. 

Parents cannot help it if their children are naughty. 

A parent should see to it that his child plays only with the 
right kind of child. 


ATTITUDE TEST CONSTRUCTION: 
CHOICE AND WORDING OF STATEMENTS 


No amount of statistical treatment will compensate for 
Poverty in the initial choice of statements. The scope oF 
Content of the attitude should be defined in detail, and as many 
stiterences as possible tabulated between people thought to Pe 
dines or weak, pro- or anti-. Preference should be given to 

ferences in behaviour, although, as our illustrations show, 
m any tests are based almost entirely on beliefs and judgments. 
hurstone advocates collecting statements about the issue from 
Conversations with people, student essays, newspaper editorials, 


tc., in order to avoid the narrowness or lack of vere s 
items thought out solely by the author. If the test is intende! 
Test of International 


a Neumann d Davidson, H., 
Altitudes, New eh ee, College, Bureau of ne 
artshorne and May, Bibliography. Rundquist, E. A., an tease a 
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for school pupils, samples of their opinions in their own words 


should be collected. Each statement or question must also be 
formulated in such a way as to seem natural and credible to the 
kind of people for whom the test is intended, so that it may be 
readily acceptable to, or rejectable by,them. Ina multiple-choice 
test, it must be conceivable that at least a few people will agree 
with the most pro-, and others with the most anti-, answers. 

Wang ! gives a useful list of rules for wording statements or 
questions. They should be short, simple, and unambiguous. 
The following is bad, since it could be taken as representing 
support for, or opposition to, birth control : 


Birth control legislation is a disgrace to our civilization. 


Double-barrelled statements are always ineffective, since some 
testees may pay attention to one clause, others to the other. 
As in the case of personality questionnaires, some of the more 
intelligent and critical testees are likely to object to ee 
attitudes being straight-jacketed by the form of the item, an! 
will want to qualify every response. This can be reduced by 
careful wording, and by allowing additional space at the bottom 
of the test blank for spontaneous comments, in which they ca? 
let off steam ; but best of all by having a trial run. That E 
the test should be discussed freely with a small group of typica 
testees and suggestions welcomed for reducing ambiguities. 
That the distortion or simplification of shades of opinion 1D 
attitude measurement is less serious than many critics Mer 
is shown by an experiment by Stouffer.?2 He obtained from 2 5 
1 Wang, C. K. A., ‘ Suggested Criteria for Writing Attitude Statements - 
J. Soc. Psychol., 1932, 3, 367-373. ae da 
* Stouffer, S. A., ‘ Experimental Comparison of the Statistical WE 
Case History Technique of Attitude Research’, Publ. Amer. Sociol. way 
1981, 25, 154-156. This experiment could be interpreted the other A 
round; i.e. it shows that spontaneous expressions of attitude Ga by 
used to give a reasonably reliable index of the attitude, when a cof 
experienced and impartial judges. James and Tenen raise a num! d to 
legitimate criticisms of questionnaires, particularly when applie they 
children, and advocate assessing attitudes by free interview. But Rice 
give no indication as to how the type of interviewer bias to which ma 
drew attention (cf. p. 21) is to be controlled. James, H. E. Oir 950, 
Tenen, C., ‘ How Adolescents Think of People’. Brit. J. Psychol., en 
41, 145-172. Further evidence on the unreliability of attitude Bee p 
even by skilled interviewers is given by Wedell, C., and Smith, B- 


i. 
“Consistency of Interview Methods in Appraisal of Attitudes’. J- APP 
Psychol., 1951, 35, 392-896. 
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students anonymous accounts of their own experiences and 
opinions about alcohol and prohibition. These completely 
unforced expressions of attitude were rated by four independent 
judges for their favourableness or unfavourableness to pro- 
hibition, their average agreement being shown by a correlation 
of -86. The combined ratings correlated -81 with the scores of 
the same students on an ordinary test of Attitude to 
Prohibition. 

By the ‘ acceptability ’ of items is meant the proportion of a 
typical group of testees who agree with the item, or who check 
each response to it. This acceptability must be estimated at 
least roughly in advance, since the levels vary markedly in 
different types of test. In a Thurstone-type test, statements 
must range from very high to very low, or strong pro- to strong 
anti-, and include a good proportion of moderate or neutral 
shades. In a test like Vetter’s the statement may be non- 
committal or neutral, while the responses should range down to 
about 10% acceptability in either direction. But in most 
multiple-choice tests (Lentz’s, Likert’s, Eysenck’s, etc.) the 
statements are around 85 to 65% and 35 to 15%. For if very 
extreme items are included, almost all the responses will be No 
or Disagree ; while if they are too middling, they cannot reveal 
any definite variations in attitude. 

Rundquist and Sletto 1 have demonstrated clearly that, in a 
test which asks for socially approved or disapproved opinions, 
questions which state the disapproved view are not always 
answered in the same way as apparently identical questions 
Stating the approved view. For instance, if 40% of a group 


accepts as True: 
The Government’s policy is subservient to big business 
interests, 
it will not usually be found that all the same testees, or even 
the same proportion, will answer No or False to: 
The Government’s policy is independent of big business 
interests. 
Much as in personality questionnaires, the unpopular question 
or item seems to arouse more emotion and to be answered less 
1 Op cit., p. 149. 
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rationally than the same item stated in the reverse way. These 
authors recommend including both types, since they touch off 
different aspects of the attitude. ; 
Eysenck and Crown ! carry the argument further by pointing 
out that acceptability depends not only on positive or negative 
strength, but also on social stereotypes. For example, these 
two statements have almost the same Thurstone scale-values : 


The Jews have too much power and influence in this 
country. 
Jews lack physical courage. 


But the former was agreed to by 88% of a group of 250 normal 
adults, the latter by only 24%. For there is a common stereo- 
type that Jews are mercenary, but not that they are cowardly. 
One would deduce from these findings that it is desirable to 
include a large number of varied items in an attitude test in 
order to cancel out the effects of such specific stereotypes. 


ITEM ANALYSIS AND SCALING 


In constructing a test one should start with at least twice 
as many items as one is likely to require for the final form. The 
various techniques for picking the best of these, and ensuring 
their relevance to a unidimensional attitude, were listed briefly 
on p. 123. The simplest is to ask a group of judges to assess 
each item, and to eliminate those which are criticized by ® 
substantial proportion. Judges can help also in defining the 
scope of a complex attitude; for example, the consensus © 
their opinions can be taken regarding the proportions © 
political, religious, sociological, and other items to include in 2 
test for general radicalism. Thurstone’s ? scaling technique is 
an extension of this, whose main object is to place each item 
on a scale of equivalent units, according to its favourablenes® 
or unfavourableness, It is based on the psychophysical metho 
of equal-appearing intervals. Thurstone usually employ’ 
several hundred judges, though reasonably accurate results cat 
be obtained with as few as twenty-five. Each is given a S© 
of items, typed on separate slips, and asked to sort them into 
eleven piles, the left-most pile, No. 1, containing items SEE 
sidered most favourable, and the right-most pile, No. 11, t 

1 Op cit., p. 148. 2 Cf. Thurstone and Chave, Bibliography- 
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most unfavourable. The other piles are intermediate and 
should be approximately equally spaced in degree of favourable- 
ness, in each judge’s opinion. It is easier, perhaps, to have 
nine piles; then the judges can first sort into three large piles 
and later sub-divide each one. The numbers of items assigned 
to each pile can be uneven, but no one pile should contain as 
many as a quarter of the total. 

A cumulative frequency graph is then plotted for each item 
(cf. Fig. 5). Say there are 40 judges: 1 puts an item in pile 1, 


Nos. of 
Judges 40 


30--------- 777 


20L------ -- 


1 2 K ie 5 6 7 8 9 Pile No. 


Fig. 5.—The Scaling of an Attitude Statement. 
2 more making 3 in pile 2; 8 calls it 8 or higher, and so on. ‚No 
one puts it lower than No. 6. The median and quartile positions 
are then read off to the first decimal place.’ ‘Thus the position 
opposite the twentieth judge is 4-2, and this is taken as the 
scale value of the item. The quartiles fall at 4-9 and 3:3, and 
the difference between them, namely 1:6, gives Q—a measure 
of the agreement between the judges. If their opinions of its 
i j lace it in 

1 A scale value will be less than 1 if more than 50% of judges p 

the first pile. Its median position is then Tund ek ae 
i by more than 
Mare GAGE Che Ne ker ar pisces ae falls outside the graph, in which 


More often one or other of the quartiles he g 
case Q is read off from the median and the other quartile instead of from 


both quartiles. 
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scale value vary widely, it is obviously unsatisfactory and 
should be eliminated. On a 9-point scale, Q should seldom 
exceed 2:0, and an average of less than 1-5 should be 
aimed at. 

To make up the final scale, about twenty to thirty statements 
are selected which are: (a) fairly evenly spaced throughout 
the scale. Thus there might be three items having each value 
1+,2+...8+; (b) lowing; (c) heterogeneous in content. 
The tester must still use his subjective judgment to ensure that 
the scale contains a variety of expressions of attitude. Thurstone 
does not mention this point, but he describes an ‘ index of 
similarity? or © irrelevance’, which would tend to reduce 
heterogeneity. It is based on the responses to the statements 
of a large group of testees, and resembles a correlation between 
each statement and every other. Actually, few attitude testers 
have ever adopted this. 

It might be thought that the judges’ own attitudes would 
affect the results of their sealing, but experiments prove that 
this is not so. Possibly it is unwise to apply scales sorted by 
adults to children, since the meanings they read into the items 
may differ. The Thurstone method is laborious, but it does 
yield scales which ‘ go down’ well with most subjects. The 
fact that they need check only a few statements, instead © 
having to give a Yes, No, or graded response to every item, 18 
an advantage. A major defect in many scales is that the scores 
are insufficiently reliable. This may be due to vagueness A 
multidimensionality in the attitude itself, or to the sma: 
number of items that subjects check. It is desirable, therefore 
to try out a new scale on a group of one or two hundred peop!® 
and to calculate their scores separately on the odd-numberé™ 
and even-numbered items. These two scores should correla Pr 
to at least -74 if the scale as a whole is to have a corrected spli h 
half reliability of -85. Note that Thurstone scale units, althouge 
equal-appearing, are not absolute. It is not legitimate 
compare an individual’s score on two or more scales TE 
via percentiles); nor can any one score, say 5, be designated F 
a neutral attitude. The Guttman type of scale is the gpr 
successful in establishing the point where favourable chang 
over to unfavourable.! 


Pa) 
*Cf. Guttman, L., ‘On Festinger’s Evaluation of Scale Analysis 
Psychol. Bull., 1947, 44, 451465. 
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MULTIPLE-CHOICE SCALES 


A separate group of external judges is not needed for multiple- 
choice scales, unless they be of the type proposed by Eysenck. 
Instead, item selection is based on the responses of a group of 
200 or so testees to a preliminary draft ofthetest. The simplest 
technique is to tabulate separately the responses of subjects 
with the highest, middle, and lowest thirds of scores on the 
test as a whole. By combining these, the distribution of 
responses for each item in the total group is obtained. As 
already mentioned, preference should be given to items whose 
distribution is not too bunched. Thus with 5-response items : 
2 8 4 5 
27 ae: 

i Ww 


Response 1 

Percent Frequency: 18 42 

Is better than: 37 55 2 

By contrasting the upper and lower groups, the most consistent 
items can be found, e.g. : 

Response 1 2 3 4 5 

Percent Freq. in Top Third: 80 45 21 4 0 

Percent Freq. in Bottom Third: 15 28 89 15 8 

ance of the difference in mean response 


ere the means are 1:99 and 2:68, and if 
the Critical Ratio of 4:1 is 


The statistical signific 
can be determined. H 
there are, say, 70 in each group, 
highly significant. Alternately, a split may be taken at the 
response which cuts off as nearly as possible half the combined 
groups—in this case between Responses 2 and 8, as this cuts off 
118/200. Then the difference between the two percentages 
above the split, 75—43=82, would be regarded as sufficiently 
large. A percentage difference of less than 15 would usually 
indicate too little agreement with the test as a whole.? 

The group can be rescored on the selected items, or, better, 
the revised test given to a fresh group, and the split-half 
reliability determined as before. With about a dozen items this 
will usually be quite high ; and if not, the necessary number of 
items can readily be calculated. Eysenck claims that his mixed 
cally favours items of middling 
dices of Item Consistency and 
948, 1, 152-166. 


1Note that this technique automati 
acceptability. Cf. Vernon, P. E., “In 
Validity’. Brit. J. Psychol. Statist. Sec., 1 
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type gives even better coefficients. Here Thurstone sorting 
and item analysis are applied. The responses to items with 
different scale values may be scored according to the following 
table, adapted from Eysenck and Crown’s article. 


Scale Responses 
Value ++ 


8-34 
TIL 
714 
65+ 
59-1 
584 
47 4 
414 
8-54 
2-94 
234 
17+ 
1.64 


„Factor analysis can be applied to test items, as well as to 
different attitude tests, in order to ensure reasonable reliability 
and unidimensionality +; but this is seldom done because of the 
vast number of correlations when there are many items to 
analyse. Stouffer 2 and his associates advance strong arguments 
for Guttman’s scalogram analysis technique. This is especially 
useful for constructing very short unidimensional and reliable 
scales. But we will not describe it, since it seems to offer little 
practical advantage to compensate for its laboriousness. One 
of its weaknesses is that the initial choice of suitable items for 
scaling depends entirely on the test-constructor’s skill. Edwards 
and Kilpatrick 3 therefore advocate, first choosing a suita ple 
range of items by Thurstone sorting, then pruning by ordinary 


item-analysis, and making a final selection by scalogra™ 
technique. 


CHEN WAAAaAataIo 
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‘Cf. Eysenck, H. J., ‘ Primary Social Attitudes : I. The Organization 


and Measurement of Social Attitudes ’, Int. J. Opinion Attitude Res. 
1947, 1, 49-84, 
2 


Op cit., p. 149, Con- 
® Edwards, A. L., and Kilpatrick, F. P., ‘A Technique for the 
struction of Attitude Seales’. J, Appl. Psychol., 1948, 32, 374-384. 
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In duplicating or printing opinionaire or Thurstone-type 
tests, pro- and anti- statements should be arranged in random 
order. In multiple-choice tests, the most pro- response is 
sometimes put first, sometimes last. This is desirable in 
order to prevent testees checking Yes (or Strongly Agree, 
No, ete.) throughout without considering each item carefully, 
also so as to reduce ‘space errors’. (In any multiple-choice 
test there is a tendency to tick the topmost or left-most 
response rather than responses printed on the right, or at the 
bottom, of a set.) 


DISCUSSION 


_ It is probably a mistake to aim at extremely high reliability 
in attitude scales, since this may be obtained at the expense 
of validity.! It will usually arise when the items are too 
omogeneous in content, or very numerous. Under these 
circumstances the testee can hardly fail to realize that they 
all refer to his own radicalism, or other attitude, and he will 
be more likely to answer each item according to his stereotype, 
or according to what he thinks is socially approved. es 
_ The attitude of the subjects to the test and tester is just as 
important as it is with personality questionnaires, though 
distortion of responses is likely to be less serious because the 
subject-matter of most attitude tests is less disturbing. One 
would not expect to get results of any value from school- 
children until they reach Mental and Educational Ages of at 


least 12 years. But it is certainly possible to construct scales 


that get across to adults of lowish ability, witness the extensive 
des of American recruits 


practical use made of them with all gra 


during the war. 
No one appears to have tried to validate attitude tests by the 


trait composite method. Such evidence as correlations with 
associates’ ratings or with indices of behaviour is useful, but 
inconclusive, since these criteria are imperfect for the reasons 
already given. As mentioned in Chap. VI, there is little correla- 
tion between ethical attitude tests and honest conduct. How- 
ever, many tests have been shown to differentiate significantly 


1Cf. Kirkpatrick, C., ‘ Assumptions and Methods in Attitude Measure- 
ments ’. dice Wena 1036, 1, 75-88. See also McNemar’s detailed 
analysis of the weaknesses of different types of scales. 
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(albeit with a good deal of overlapping) between groups which 
would be expected to possess contrasting attitudes. For 
example, Eysenck finds good discrimination on his radicalism 
scale between people who vote Conservative, Liberal, and 
Labour. On Thurstone’s Church scale the average scores of 
Roman Catholic students was 2-90, of Jewish students, 5-44. 
Watson’s test of fair-mindedness (cf. below) shows social 
psychology students in an eastern American university to be 
less prejudiced than middle-west parsons. Likert finds large 
differences in attitudes to negroes between southern and north- 
eastern colleges. Adorno and Frenkel-Brunswick, also Allport 
and Kramer! have traced out extremely interesting and 
psychologically plausible, differences between subjects. with 
high and low scores on tests of racial and religious prejudice 
(ethnocentrism). The consistent way in which strong prejudice 
connects with factors of upbringing and with the personality 
as a whole is very convincing. Many scales have been use 
successfully in investigations of the effects of propaganda or 
of various types of instruction.2 


STUDIES OF GROUP ATTITUDES 


Actually it is hardly hecessary, in studying differenes® 
between attitudes of groups to use tests which yield reliable 
individual scores ; less elaborate techniques are often adequate. 
An enormous amount of work is done in social and applic, 
psychology on group attitudes which falls outside the scope pa 
the psychology of personality. Examples are children’s interes t 
in different school subjects or leisure pursuits at diferea 
ages, employees’ views on sources of satisfaction and dissa 
faction in their work, the innumerable surveys conducted a 
Gallup polls, Mass Observation, and like organizations, not a) 
speak of newspaper competitions which determine their na 
most popular film-star or novel, and voting in Parliamenta y 


1 Adorno, T. W., and Frenkel-Brunswick, E., et. al., The Authoritarie® 
Personality. New York: Harper, 1950. Allport, G. W., and Krai 
B. M., ‘Some Roots of Prejudice’. J. Psychol., 1946, 22, 9-89. kins 
SCE Lichtenstein, A., “Can Attitudes be Taught?’ Johns Hop. 
Univ. Stud. Educ., 1934, No. 21. ities DY 
®Cf. Vernon, P. E., ‘ The Assessment of Psychological EA 
Verbal Methods’. Industr. Hlth. Res. Board Rep., No. 83. Lon 
H.M. Stat. Office, 1938. 
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elections. These normally rely on one or two questions 
only as indices of majority attitudes, and compensate for the 
courequent loss of reliability by the large number of voters. 
Nevertheless, the validity or practical significance of the results 
is often unsatisfactory, partly because of the difficulties of 
ee really representative samples, and partly because of 

oubts as to the proper interpretation of responses. Unless the 
en are extremely straightforward, the respondents may 
ae Ka S meanings into them from those expected by the 

a en Eysenck points out that the proportion of a 
z pulation favouring a given issue may appear to range almost 

nywhere from about 10% to 90%, depending on the kind of 
duearion asked and the extent to which it touches off common 
fee (ef. p. 152). Cantril ? provides a useful summary of 
int arge body of research into wording of questions, effects of 
Bee ade bias, sampling and other errors in opinion-polling ; 
MeN cNemar gives a detailed critique of polling methods. 
don pet concludes that the results of such polls would be 
ee erably more accurate if more use was made of properly 
ee a tucted ‘scales and tests, though these could be shorter 

nd simpler than those required for measuring individual 


attitudes, 


INDIRECT TESTS OF ATTITUDES 


d that the questions or statements 
be too direct. A number of tests 
uised have been suggested.® 
ulties of validation 
practical purpose. 


. We have already suggeste 
In attitude tests should not 
ern are still more carefully disg 
ee it is because of the obvious diffic 
Ri they have seldom been used for any 

- B. Watson’s 4 early test of fairmindedness is an ingenious 


One Testees are told that it is a ‘Survey of Public 
pinion’, but their prejudice or fairmindedness in political, 
1 An amusing example of this is quoted by McNemar. In one American 
Survey a remarkable proportion of negroes were found to be opposed to 
Ba ets control of profits. Closer questioning showed that they 
red that God alone should control prophets. 

Cantril, H., et. al., Gauging Public Opinion. Princeton, N.J.: Prince- 

ton University Press, 1944. 
= useful survey is provided by Campbell, see Bibliography. 

Watson, G. B., ‘The Measurement of Fair-mindedness’. Teachers 

College Contr. Educ., 1925, No. 176. 
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social, and religious matters is brought out in six varied sub- 
tests. One of these contains such statements as : 


All, Most, Many, Few, No Roman Catholics are super- 
stitious. 


An ‘All’ or a ‘None’ answer is taken to show repen 
prejudice ; the less extreme answers are accepted as a sgn 
tolerance. In another sub-test a statement is presented : 


In the United States 3% of the people own 60% of the 
wealth. 


This is followed by several conclusions that might be drawn, 
including the following : 


The great incomes should be more heavily taxed. is 

Such a concentration of capital is inevitable if industry Í 
to be effectively developed. 

No conclusion stated here can fairly be drawn. 


The testee who checks the first of these as a legitimate niad i 
gets a mark for socialistic prejudice ; the second—capitalisti 5 
Only if he checks the last does he obtain a mark for en 
mindedness. Other tests which measure the influence of ve 
on logical reasoning are Morgan’s, and the Watson-Glest 
Tests of Critical Thinking. “Watson's use of extreme ees 
moderate opinions is amplified in Thouless’s 2 study of per 
of certainty in religious beliefs. He shows that the distribu oat 
of responses to multiple-choice questions on religious top 
which are highly charged with prejudice, tend to be bimo ic, 
since so few individuals take up moderate views. He ie EE 
too, that this might provide a measure of irrational thin oat 
As in the case of Personality questionnaires, however, oe 
difficult to prove that this “response set’, or extreme 
tendency, is psychologically meaningful. 


ic 
* Morgan, J. J. B., and Morton, J. T., ‘The Distortion of Syllogity 0, 
Reasoning Produced by Personal Convictions’. J. Soc. Peychok, 1 Critical 
89-59. Glaser, E. M., ‘An Experiment in the Development 0: 
Thinking’. Teachers College Contr. Educ., 1941, No. 848. Sen Belief” 
* Thouless, R. H., ‘The Tendency to Certainty in Religious 
Brit. J. Psychol., 1985, 26, 16-81. 
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Distortions of perception and memory have also been em- 
ployed, e.g. by Horowitz, and Cattell.! Thus an Aussage test 
(cf. p. 88) may be given with suggestive questions that allow 
scope for colour prejudice to enter. Thematic apperception 
(ef. P- 181), doll play, and some other projective techniques have 
similarly been applied with material liable to touch off various 
attitudes. These do not yield any quantitative scores, but 
assessments of the subjects’ responses correlate fairly closely 
with attitude scale scores. Perhaps the most promising of recent 
tests are Hammond’s,? which take the form of information 
tests. A typical item is : 

The average weekly wage of workers in the U.S. in 1948 
was: (a) $40, (b) $60. 


The true answer happens to be $50. Hence the choice of (a) 
indicates socialist, (b) capitalist, attitudes. Finally Travers 8 
and others have shown that a person’s estimates of the propor- 
tion of people who hold a given attitude correlates positively 
with his self-expressed attitude. For example, the radical is 
much more likely than the conservative to overestimate the 


Proportions of radicals in the population. 


INTERESTS 


Interests are very much the same as attitudes, though their 
definition is also a matter of controversy.* Their subject- 
matter is usually more concrete. We are interested in or like 
athletics, music, model aeroplanes, etc., whereas we have 
favourable or other attitudes to religion, foreigners, ete. But 
an interest is just as complex as amalgam of subjective feelings 
and objective behaviour tendencies, and interests are at least 
as manifold and as difficult to reduce to a few unidimensional 
variables. The best-known test, Strong’s Vocational Interest 
ward the Negro ’. 
‘ The Objective 
1-90. 
r-Choice: An 


1 Horowitz, E. L., ‘ The Development of Attitude To 
Arch. Psychol., 1936, 28, No. 194. Cattell, R. B., el. al., 
Measurement of Attitudes’. Brit. J. Psychol., 1949, 40, 8 

*Hammond, K. R., ‘ Measuring Attitudes by Erro: 
Indirect Method’. J. Abn. Soc. Psychol., 1948, 48, 3848. s 
i 3 Travers, R. M. W., ‘A Study in Judging the Opinions of Groups’. 

Tch. Psychol. 1, 37, No. 266. b k f 

“CE. en: E.“ Tatarest » as a Psychological Concept’. Brit. 
J. Psychol., 1949,{39, 184-195. 
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Blank, solves the problem of what to measure by taking 
different occupations as the criterion. Obviously the number 
of these is limitless, and many occupational interests overlap, 
positively or negatively. Other testers have therefore adopted 
@ priori classifications of a limited number of more general 
types of interest. Factorial analysis is beginning to provide a 
more objective answer. 

Allport and Vernon’s Study of Values is based on the classifica- 
tion proposed by Spranger in his “Types of Men’? It is 
designed to test an individual’s relative standing on six main 
types of value or general interest : theoretical or scientific, 
economic or utilitarian, aesthetic or artistic, social or humanl- 
tarian, political or power-seeking, and religious or spiritual. It 
employs the forced-choice technique. The testee is told to rank 
his order of preference to the four answers to such questions as : 


If you could influence the educational policies of the public 
schools of some city, would you undertake : 


(a) to promote the study and performance of drama į 

(b) to develop co-operativeness and the spirit © 
service ; 

(c) to provide additional laboratory facilities ; r 

(d) o Dame school savings banks for education in 
thrift. 


If he puts (c) top he scores 3 for theoretical values, and if he 
puts (a) bottom he scores 0 for aesthetic values, and so on. py 
summing the answers to all the questions a range of mar 
from 0 to 60 is possible on each value. Note that, as in an 
attitude test, the questions are rather indirect expressions © 
the values, couched in terms of behaviour. Testees are ai 
told the object of the test since one does not wish to get merey 
their own impressions of how religious, artistic, etc., they 47° 
The items have been analysed in the manner described aboY® 
and shown to yield reasonably consistent scores. 


, 
*Cf. Vernon, P. E., ‘ Classifying High-Grade Occupational Interests 
J. Abn. Soc. Psychol., 1949, 44, 85-96. Rev: 

* Allport, G, W., Vernon, P. E., and Lindzey, G., Study of Values ( h 
ed.). Boston : Houghton Mifflin, 1951. Vernon, P. E., and Allport, ae) 
‘A Test for Personal Values, J. Abn. Soc. Psychol., 1931, 26, 281- 
Spranger, E., Types of Men. Halle : Niemeyer, 1928. 
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‚The Kuder Preference Record + is similar, but aims to measure 
nine more specifically occupational interests, at high school, 
college, and adult level: Mechanical, Computational, Scientific, 
Persuasive (salesmanship, etc.), Artistic, Literary, Musical, 
Social Science, and Clerical. (Other editions include Outdoor, 
Sociable, Practical, Theoretical, Smooth Personal Relations, 
and Dominant interests, together with a verification or check 
score.) It is widely employed by American vocational guidance 
centres, and is noteworthy for its ingenious rapid-scoring 
device. The testee pricks his preferences through a series 
of sheets ; and when these are unfolded the numbers of pricks 
in printed circles, corresponding to each interest type, are 
counted up. 
_ Strong’s Vocational Interest Blank2 The occupational 
Interests claimed by a person undergoing vocational guidance 
tend to fluctuate, and are often based on entirely mistaken 
notions of what the occupations entail. Freyd and others, at 
the Carnegie Institute of Technology,® and later Strong at 
Stanford University, hit on the method of recording the testee’s 
immediate likes and dislikes for a large number of miscellaneous 
items, and then deducing his true interests from the total 
pattern of his responses. Strong’s Blank contains some 420 


items, including lists of: 
Occupations, e.g. actor, advertizer ... Y.M.C.A. worker. 
Amusements, e.g. golf, tennis, chess, pet canaries. 
Subjects of study, e.g. algebra, arithmetic . . - zoology. 
Miscellaneous activities, e.g. repairing a clock, arguments, 


saving money. T ? 
Types of people, eg. optimists, pessimists, foreigners, 

cripples, socialists. 
ses! Henry Ford, ete., ete. 


Famous people, e.g. Caruso, Edison, 


Each item is followed by L I D (like, 
other simple responses, one of which is t ‘ 
no time limit, but rapid answering is advised, a 


rd. Chicago: Science Research 


indifferent, dislike), or 
o be checked. There is 
nd the test 


1Kuder, G. F., Kuder Preference Reco 
Associates, 1942. 

2Strong, E. K., Vocational Interest Blank. Stanford, Cal.: Stanford 
University Press, 1927. Vocational ‘Interests of Men and Women. Stanford, 
Cal. : Stanford University Press, 1948. ý 

3 Cf. Fryer, Bibliography. 
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should not take more than about 35 minutes. The scoring is 
purely empirical, being derived from the actual responses of 
very large groups of individuals concerned in particular 
occupations (artists, architeets, doctors, farmers, real estate 
salesmen, psychologists, etc.). Several techniques of con- 
structing scoring keys have been suggested. Strong finds the 
following simple one effective. Take as an example the item : 
Actor LID, and its scoring for interest in Personnel Manage- 
ment. The percentages of a group of personnel managers, and 
of members of other vocations in general, who check the 
responses are: 


L I D 
Personnel Managers 49 88 13 
All others 38 85 27 


Difference +11 +3 —14 


That is, personnel managers are slightly more apt than most to 
say that they like this occupation. The differences are trans- 
posed into somewhat smaller figures, and the final marks for 
the item are +2, +1, and —3 respectively. Similarly the scores 
are determined for each of the 1200 or so possible responses 
for each occupational group. An individual’s score for an 
interest is the sum of his + and — marks, which thus indicate 
the resemblance of his pattern of interests to the typical 
patterns of personnel managers, or artists, etc. Scoring & 
single blank by hand against forty occupations takes several 
hours ; it is therefore usually done by machine, at considerable 
expense. If the testee’s score falls within the range of scores 
given by 75% of the personnel managers (or other) group, he 15 
given an A-rating for that occupation ; if it is within the range 
of the lowest 25% he is given a B-rating ; if it is lower than the 
score of any of the original personnel managers, a C-rating- 
A candidate for guidance is advised to enter only those occu- 
pations for which he receives an A-rating. 

Keys for some forty-nine different occupations are available i 
Another blank is published for measuring women’s interests 12 
twenty-six occupations, and a new test has: recently bee? 
constructed by Clark 1 for differentiating interests in civilian 
and naval skilled trades. The same technique has been applic 


1 Clark, K. E., ‘ A Vocational Interest Test at the Skilled Trades Level ’» 
J. Appl. Psychol., 1949, 88, 291-303. 
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by Garretson! to measuring the academic, technical, or 
commercial interests of secondary schoolboys. None of these 
tests would be of any use in Britain, unless restandardized, 
Since it is unlikely that the likes and dislikes of Californian 
and British vocational groups would be sufficiently similar. 
Many of the present items, also, might arouse ridicule (e.g. 
Acting as yell-leader, Pursuing bandits in sheriff’s posse, People 
with gold teeth, Men who use perfume). The Study of Values 
and Kuder Preference Record are also too American in phrase- 
ology, Some adaptations of the former have been used here 
experimentally, but are not published. A test of the Strong 
type was constructed for Army recruits during the war, with a 
view to allocating them to one of six main types of employ- 
ment.?2 The mechanical scores were shown to have considerable 
Predictive value in picking men for mechanical jobs, but the 
test never came into general use, both because the scoring was 
too lengthy and because the keys were not very reliable. 
Strong considers it desirable to have standardization groups of 
about 500 in each occupation for which a key is being prepared. 
€ found also that interest tests tend to be less effective among 
average or dull adults than at professional and business levels ; 
education and intelligence have a considerable influence on the 
responses, For example, the Likes checked by a skilled 
Instrument mechanic tend to bear a closer resemblance to those 
checked by an equally intelligent clerk than to those of a 
Ower-grade mechanical worker. None the less, a test covering 
a few main types of high-grade occupational interests would 
be extremely useful in this country, though its preparation and 
Standardization would be a herculean task. . : aan 
Weakness which applies to all the tests in this (as aa 5 
Previous) chapter is that they are extremely open to faking. 
€veral experiments have shown that subjects can raise Se 
wer their scores significantly if they wish to make themselves 
Out as possessing certain interests. Thus, none of these tests is 
very suitable for educational or vocational selection, wiae an 
incentive to fake may operate. But this does not den. SER 
ir value in vocational guidance or other situations where the 
ı E F P ixpressed Preference and 
Comia oe 0. K, ee ames eg sre! College Contr. Educ., 
1980, No. 396, 
à Cf. Vernon and Parry, Bibliography. 
I2 
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subjects realize the desirability of truthfulness. Some psycho- 
logists would criticize the extreme empiricism of a test which 
takes no account of the meaning the questions have for the 
subject. Their doubts may be heightened by an experiment 
carried out by Burnham and Crawford,! who obtained a set o. 
purely chance scores on the Bernreuter Inventory and the 
Strong Blank by throwing dice to decide each response. “oF 
applying the keys it was found that the dice had obtaine 
scores characteristic of a psychoneurotic boy scout master OF 
journalist ! A 
Nevertheless all the interests tests mentioned do work. 
Correlations around -5 are found between Study of Values scores 
and associates’ ratings on these values, which is fairly high 
considering the difficulty of explaining the values to the raters. 
Large differences are obtained in values profiles between student 
groups taking different courses (engineering, arts, theological, 
etc.), and correlations of up to -45 with some of the Strong 
interests,” in spite of the totally different techniques of the 
two tests. Strong has shown that the repeat reliability co“ 
efficients of interest scores in adults are remarkably high; 
approximating -80 even over a 20-year period. Naturally 
Scores are somewhat less stable in adolescents, Persons teste 
before taking up their careers, who are not told the results, 40 
tend spontaneously to enter vocations consonant with these 
results; and those who later forsake a vocation are found tO 
score lower than those who stay in it. The test may also be 


useful for predicting occupational success, as in Kelly an 
Fiske’s research (cf. p. 26). 


OTHER METHODS OF ASSESSING INTERESTS 


Many vocational psychologists make use of a blank In 
Strong’s, or else a much briefer list, and by going over lete 
testee’s responses in an interview build up a fairly comp 


d 

‘Burnham, P. S., and Crawford, A. B., “ The Vocational Interests fag, 
Personality Test Scores of a Pair of Dice’. J, Educ. Psychol., 1935 
508-512. 

2 Cf. Berdie, Bibliography. ted 

3 C£. Duffy, E., and Crissy, W. J. E., ‘ Evaluative Attitudes as Rea 
to Vocational Interests and Academic Achievement’. J. Abn. 
Psychol., 1940, 35, 226-245. 
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Picture of his interests. This, of course, does not constitute a 
test, and subjective interpretation plays a large part. It might 
be thought that by classifying items under a few major types, 
and simply adding the numbers of responses under each type, 
a useful set of objective scores, or an interest profile, could be 
obtained. Thorndike! has studied adult interests by such a 
test, covering thirty-five topics or fields ; and recently Guilford, 
Shneidmann, and Zimmermann ê have published their GSZ 
Interest test, which includes twenty items for measuring each 
of eighteen main types of vocational and leisure interests. But 
this method fails to work well because people are found to vary 
So enormously in their standards of marking Like or Dislike. 
Some interpret Like far more broadly than others and check 
three-quarters or more of the items in this way, whereas others 
are more selective and check less than a quarter. To some 


et this may reflect genuine breadth or narrowness Asi 
interests, but often it is merely an irrelevant ‘ response Se r 
i i s and the Kuder 


This is the reason why both the Study of Value 
Preference Record force the testee to indicate relative preferences. 
In so far as he scores high on one value or interest, he must 
obtain lower scores on some other interest or interests. Though 
no direct allowance is made for response set IN the Strong 

lank, the scoring keys probably compensate for most of it 


automatically.? | 
Strong 4 has investigated the items checked by men of various 
ages from 15 to 55 years, and on this basis developed a scoring 


ey for what he calls Interest Maturity. Similarly Furfey and 


„horndike, E. L., * ue of Reported Likes and Dislikes for 
J. Pou e ak RA as Indications of Personal Traits '- 
s l. ‘ 
i asz ee Burn 5 Ba ills, Cal. : Sheridan Supply Co., 
1948. Cf. Guilford, J. P., Shneidmann, E. S., and Zimmermann, MY. Tp 
The Guilford-Shneidmann-Zimmermann Interest Survey ™ J rs 
Psychol., 1949, 13, 302-306. Incidentally these authors show that ho! 
Fe give very poor indications of occupational interests. 
th: Tyler, L. E. (private communication) has pointed ou 
hat most subjects tick a vast majority of Li 
th erences between different interest scores 41° 
ley choose. Tests of the Strong type do not wo) 
eas adults, because they have not developed 
ms of Disli 
« Strong, Tic, Change of Interests with Age. Stanford, Cal. : Stanford 
University Press, 1931. 
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Weber ! constructed tests of emotional maturity or develop- 
mental age for children, which included lists of interests, games, 
books, ete., common among boys from 8 to 18 years, together 
with Pressey X-O items (ef. p. 175). It was claimed, for 
example, that delinquents tend to give responses typical of 
normal children younger than themselves, Probably, however, 
fashions in interests vary too much for such tests to have much 
permanent value. 5 

Wyman 2 devised an indirect test of interests for Terman $ 
studies of gifted children, which was based on free word 
association responses, Groups of children were taken who, 
according to teachers’ ratings, were keenly or weakly interested 
in ‘ intellectual, social, or activity ’ interests. Their responses 
to a list of stimulus words were tabulated, and differential 
scores developed for each response. Thus when any new chil 
takes the test, his responses can be scored for resemblance to 
those of the intellectual, social, and activity groups. The 
technique is exceedingly laborious, and the resulting interest 
measures were found to have low validity (probably because 
the three standardization groups of 130 each were too small). 
Thus the correlations between the scores of fresh groups aD 
teachers’ ratings were only -54, -35, and -20. Moreover, the 
correlations of the three interest scores with one another range 
from ‘68 to -80, suggesting that the original criteria for selecting 
the groups were far too much affected by halo. The sets 1 
children were not really distinct, as were Strong’s occupation® 
groups or Kent-Rosanoff’s mental patients and normals (cf. p- um 
Nevertheless Wyman’s technique deserves mention because ofi s 
objectivity. It would be very diffieult to guess the associations 
that score highly, or to fake one’s results. Other objective tes 
of interests have already been discussed in Chap. VI. 


MASCULINITY-FEMININITY TESTS 


Strong provides an additional scoring key based on e 
differential likes and dislikes of men and women. As alrea 


1 Furfey, P. H., ‘ Developmental Age’. Amer. J. Psychiat., 1028, $ 
140-157. Weber, C. O., Further Tests of the Wells Emotional Age SC 
J. Abn. Soc. Psychol., 1932, 27, 65-78. sts 
2? Wyman, J. B., ‘ Tests of Intellectual, Social, and Activity Inten BB 
Genetic Studies of Genius, Vol. I., by L. M. Terman. Stanford, 
Stanford University Press, 1925. 
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ed (p- 130) a similarly constructed key is available for 
ae a Multiphasic Inventory. Sex differences occur 
iR nal er psychological qualities and abilities, for example 
et ge a meanings of words drawn from different fields 
forty ety hus Slater’s 1 Selective Vocabulary Test contains 
Leonie ansi red better by males, and forty by females. 
The ner i from 18 years up. The most elaborate test is 
containing ast sagen tte (M-F) Test of Terman and Miles,? 


Word Association (all items have multiple-choice responses); 
Inkblot Associations; General Information ; Emo- 
tional and Ethical Attitudes; Interests ; Opinions ; 
Introvertive Responses. 


Consi r 
nsiderable overlapping of the sexes occurs on the test as 


a 
ne but mean scores are about +50 for men and —70 for 
(Extreme es seale from +200 (extreme masculine) to —200 
like the o eminine). Homosexuals do not necessarily score 
inversion pposite sex, and a tentative additional scoring key for 
more ma 1s provided. Athletes of both sexes tend to obtain 
and ae and artists more feminine, scores than average ; 
Thus A other plausible group differences are described. 
for its bh ough the test, like the Strong Blank, may be criticized 
psycholo ind empiricism and lack of any coherent underlying 
instrum gical theory, it does seem to be a useful research 
BR to nee The three tests—Strong, MMPI, and M-F—over- 
reasonable extent. Shepler ° finds an average inter- 


cor 4 
relation of -58 in single-sex groups. 


London: Harrap, 1944. 
„Interest Analysis Test. New 


1 
Slater, P., Selective Vocabulary Test. 
d Personality. New York : 


aT 
Yo L. M., and Miles, C. C., Attitude 
McGra sychological Corporation, 1938. Sex an 
a zul, 1936. 
J, Connie B. F., ‘A Comparison of Masculinity-Feminity Measures °. 
ult. Psychol., 1951, 15, 484-486. 


X 
Projection Techniques 


I Freudian theory, ‘ projection’ is the mechanism whereby 
he Ego defends itself from unwelcome or repressed wishes 
and ideas by attributing them to others, or by projecting them 
on to the external world. The aggressive paranoiac, fe? 
example, thinks that everyone is attacking or plotting against 
him. Few of the numerous tests which are commonly groupe? 
under this heading directly involve Freudian projection; 
rather they provide a vehicle through which the subject 
expresses his personality structure. (Cattell 1 points out how 
heterogeneous they are, and suggests that ‘dynamism ’ tests 
might be a better term.) Thus they are closely linked with the 
expression of personality through movement, which we con” 
sidered in Chap. IV. Indeed Bell’s useful and comprehensive 
summary classifies expressive movement and graphology he 
projective techniques. In this chapter, however, most of ra 
techniques reveal personality through its effects on a subjec 
perception of some stimulus, and on the mental association” 
imaginative or creative activities thereby aroused, rather 
through bodily characteristics or activities. Moreover, t o 
take account of what we called content, in addition to style, 
expression, . ho- 
Frank * has pointed out that they originate from psy© on 
analytic techniques of dream analysis and free association ei 
the one hand, and from Gestalt psychology, with its emp her: 
on the whole as more than the sum of its parts, on the ct the 
The dream is “the royal road to the Unconscious’; an en 
creative productions of writers and artists have often Ay 
claimed by psychoanalysts to reveal underlying re. - 
trends and conflicts. At the same time, no single trait, Mr 
anism, or complex manifests itself in isolation from the re$ 


f 
5 sc 
* Cattell, R. B., ‘Projection and the Design of Projective re 
Personality’. Char. & Person., 1944, 12, 177-194. nality ” 
? Frank, L. K., © Projective Methods for the Study of Perso 
J. Psychol., 1989, 8, 389-418, 
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Te personality structure ; equally no single feature of a person’s 
antasies, nor of his responses to a projection test, should be 
considered in isolation. Projection testing arose indeed partly 


as a revolt against the search for tests of particular traits, 


because it seemed to yield insights into the structure of the 
ns that these methods 


Personality as a whole. In effect this mea 
are not tests at all. They do not set out to measure specified 
variables, any more than the psychoanalyst, say, aims to 
Measure the strength of his patient’s Oedipus Complex. We 
shall see, also, that subjective interpretation by the ‘ tester’ 
enters to a great extent, and that attempts to render the 
methods more objective—with which this chapter is largely 
concerned—have proved disappointing. Another important 
difference from orthodox testing is that the stimulus situation 
and the subject’s responses are as unconstricted or ‘ unstruc- 
tured’ as possible. This allows maximum scope for individual 
differences, and it helps to reduce the self-consciousness and 
the critical attitudes which are so prominent in answering 
Personality questionnaires. Usually certain instructions are 
Standardized, or certain material is employed, with a view 
to stimulating the subject’s fantasies; and these do limit the 
Situation sufficiently to make possible comparisons between 
One subject and another (thus dreams, artistic productions, 
and completely free play are not, in fact, very suitable for 
Projection testing). But the subject’s spontaneous reactions 
are observed and recorded in full; he is not forced to 
choose between multiple responses made up by the tester, 
Nor does the tester attend only to a few specified categories 
behaviour. 
Rn the limitation just men 
techniques can be, and have been, em 
there has been a plethora of new ones in recent years, 
and our survey will be confined to those which are best 
established. Ir addition to Bell’s book, an article by Sargent 
may be recommended as giving an excellent description an 


classification.1 


tioned, an enormous variety 
employed. Indeed 


Bell and S Ibliography. Useful accounts of selected 
techniques er Te Rapes Ht Drawing, and Finger-painting, 
Mosaics and Bender-Gestalt, together with a discussion of the theo ai 
Te plications of projective testing, are contained in : Abt, L. E., and Bellak, 

+» Projective Psychology. New York : Knopf, 1950. 
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FREE WORD ASSOCIATION 


f Though relatively little used nowadays, this was not only the 
first projection test, but also one of the earliest methods for 
exploring mental differences—being developed by Galton in 
1879. Usually a list of 50 to 100 stimulus words is read out by 
the tester; to each word the subject responds with the firs 
word that comes to mind. He is told not to search about 28 
particularly apt associations. Many of the associations = 
superficial verbal habits—opposites, rhymes, genus-species, ete- 
(Jung’s ‘ objective ’ type); e.g. black-white, father-mother. 
But a few stimuli may touch on emotional complexes and leas 
to personal (Jung’s * subjective’ or ‘ egocentric ’) respo nisa 
Often these are accompanied by signs of embarrassmen 
blocking, or laughter, and by a slow reaction time (2 seconds 3 
more) or complete failure to respond. These so-called comp 1a 
indicators draw attention, as it were, to a sore spot in t 5 
personality, which repays fuller exploration. Some testen ce 
through the list a second time and ask the subject to repro n 
his original responses, Failures of reproduction are & 
considered significant. spilities 
The first systematic studies of the diagnostic possibi 
of the test among psychoneurotic patients were carried ou Br 
Jung.! He also experimented with the psychogalvanic he ug 
whose deflections provide another sign of tension (t vallel 
they do not, as Whately Smith 2 claimed, closely pr ue 
lengthened reaction times). More fruitful is the Luria techn age 
of recording voluntary and involuntary muscular accomp 
ments (cf. p. 54), N tains 
Jung’s list of 100 words is often used, since it conte it, 

stimuli likely to evoke many common complexes ; Rapahay J 
Gill, and Schafer provide an alternative. Kent and Re ok) 
list was selected for a different purpose and avoids words itable 
to ‘ call up personal experiences’, Cattell 4 gives a list sul 


: 1918- 
1 Jung, C. G., Studies in Word Association. London : Heinemann Paul, 

2 Smith, W. W., The Measurement of Emotion. London: Keg 
1922. nity" 


ER a 
® Kent, G. H., and Rosanoff, A. J., * A Study of Association in Ins 
Amer. J. Insanity, 1910-1911, 67, 37-96, 317-390. 


. rsity? 
“Cattell, R. B., A Guide to Mental Testing. London: Unive 
London Press, 1936. 
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for a i s ` 
is ae en. to children, and another by Boyd (unpublished) 
o N ere Scottish Child Guidance Clinics. ‘ Chain ° 
the subject s association tests are sometimes preferred, where 
ind co is instructed to say everything that comes to 
Meltzer 1 bey with a given stimulus word or words. 
Parents h example, studied children’s attitudes to their 
one them to ‘think aloud’, first about some 
mother? ze s like ‘table’ and ‘ ball’, then ‘father ’ and 
een s e recorded the first ten associations to the 
The clini BR 
the nn or qualitative applications of free association by 
simplest res a or psychiatrist lie outside our scope. he 
that fall und od of scoring responses is by counting the numbers 
proposed, b T various types. Many classifications have been 
derable dis Wells and Murphy ? show that there are con- 
by differe we when the same responses are classified 
Between T testers, and that there seems to be no correlation 
traits or F types to which a subject is prone and his personality 
is neurotic or psychotic syndrome. The same 


conclusi 

y R probably holds for the diagnostic scheme elaborated 

paport, et al. Other measures which have been widely 
dian reaction time, or its 


Investi % 

En ge include the average or me 

average n, the total number of complex indicators, and the 

Correlate Perchogelyan response. None of these seems to 

other per. consistently with tests or ratings of emotionality or 

deserve sonality traits. A few more suggestive findings 

Scores Er a Cantril $ showed that persons with high 

stimulus y the Study of Values test react more quickly to 

Gilliland’ words connected with their values (cf. also Moore and 
Arrow. an of ‘aggressive’ words, p. 90). Fisher and 

their subj ‘ound that depressed or elated moods, induced in 

jects by hypnotic suggestion, affected the content and 

1 

Se eg H., ‘Children’s Attitudes to Parents’. Amer. J. Ortho- 
ae eae ele 

1919, ee ber “Association Type ® 
Taecox, Mani 76. Murphy, G., ‘Types r 

1928, 2, 53 Fr and Normal Persons - 

42, en H., General and Specific Attitudes ’- Psychol. Monogr., 1932, 
4 r = 

Gise ben: V. E., and Marrow, A. J+ « Experimental Study of Moods a 

` 'erson., 1934, 2, 201-208. 


nd Personality » Psychol. Rev., 
of Word-Association in Dementia 
Amer. J. Psychiat., 
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the speed of response. Meltzer 1 was able to classify children’s 
chain associations fairly reliably under such headings as 
pleasant vs. unpleasant tone, attachment to parents, level of 
socialization, and then to study the kinds of homes in which 
healthy and unhealthy attitudes appeared. 

An entirely different approach to the quantification of word 
associations was put forward by Kent and Rosanoff ? in 1910-11. 
(This was the prototype of the empirical method of standardiza- 
tion, later adopted by Haggerty, Olson, and Wickman; by 
Bernreuter, Strong, and others in different contexts.) They 
tabulated all the responses of 1000 miscellaneous normal persons 
to their special 100-word list, and noted the frequency of each 
response. When a new subject takes the same test, the fre- 
quency values of all his responses are summed to give a measure 
of what is called his idiosyncrasy (a low score) or commonality 

- (a high score). Alternative simpler forms of scoring are Ka 
count the number of common responses (i.e. the modal or mos 
frequent responses of the standardization group), or the number 
of individual responses (those not listed in the tables owing t° 
the infrequency of their appearance). Such tables are, of cou 
useful only among people similar in language and background to 
the standardization group; they quickly get out of date, an 
would certainly be unsuitable in Britain. Woodrow and Lowel 
prepared a new set for children, incidentally using a wein 
instead of oral form of the test. O’Connor’s 4 tables were bate 
on the responses of 2000 U.S. industrial workers. Proba 3 
the best plan for anyone using the test now is to score individu 
responses by reference to his own group of subjects, that is, ao 
count for each subject the number of responses given by 
other subject. me 

The empirically scored word association test does have 5° o 
value as a measure of mental abnormality; for Kent and BER 
showed that the average member of their standardization gt 
gave 7% of individual responses, whereas 247 mental hosp? Z 
patients gave an average of 27%. The present writer ee, 
obtained fair correlations between individual od Bee n 5. 
other measures of emotionality in a group of normal stud® 


1 Op cit. 2 Op cit. ney 
* Woodrow, H., and Lowell, F., ‘Children’s Association Frea® 
Tables’. Psychol. Monogr., 1916, 22, No. 97. 1928. 


* O’Connor, J., Born That Way. Baltimore: Williams & Wilkins, 
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Many indivi 

RER responses, low communality, slow reaction 

ite ET er complex indicators all overlap to some extent, 
psychological significance is most obscure. Some 


write A > Be 2 
rs have identified idiosynerasy with “autistic thinking ’, 


othe: i z ; A 
rs with introversion or with emotionality, some with 
ck of intelligence, and 


i i 3 

eigene or originality, others with la 

EEU aus for the most part the relations with other tests and 

Bears no vo that the Kent-Rosanoff technique 

Eniste. e ays to have been abandoned, even by psychia- 

ES itional reasons for this are the lack of satisfactory 
g tables, and the length of time required for giving the 


eena scoring. 

an er, onplipauone of empirical techniques are Wyman’s 

ana Kr zation against interest groups (cf. p. 168), and Kelly 
ey’s 1 even less successful attempt to measure children’s 


char: = 

acter traits through word associations. 
PRESSEY X-O OR CROSS-OUT TESTS 

tests were devised, not as a 


t as an exploratory instru- 
ling the stimuli that 


Re association, Pressey’s ? 
ment i = any particular trait, bu 
evoke = e applied in group form, for reveal \ 
aaa ag responses. Form A, for adults, contains four 
first at each with twenty-five rows of five words. In the 
to him -test the subject crosses out all words that are unpleasant 
second and encircles the most unpleasant ın each TOW. In the 
subject each set of five is preceded by a word in capitals : the 
encirel crosses out the words associated with this stimulus and 
regar EA the most closely associated. In the third, things 
Buih as wrong are crossed out and encircled, and in the 

, things about which the subject has worried or felt 


ne 4 Hie 
rvous. Form B, for children, contains three similar sub-tests, 
like or interest, 


Hee for reactions of wrong; worry, an i t 
Briti ively. Collins ® adapted this and published it for 
ish use, and Bennett and Slater included it ın their Sutton 

ds in the Social ‘Sciences. 


1 
le el T. L., and Krey, A. C., Tests and Metho! 

Te ro ei Senibner, 103 
Abn raed S. L., ‘A Group Scale 
1919, c. Psychol., 1921, 16, 55-64. 
* Collins, M., ‘ Briti 

., ‘British N for the Pressey X- 

Psychol., 1927, 18, et ‘or 


e Emotions ’. J. 


for Investigating th 
ed by Stoelting, Chicago, 


Tests publish 
O Tests `. Brit. J. 
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3, 
Booklet (cf. p. 125). A later edition, known as Pres ci 
Interest-Attitude Test 1 includes a section on personal c 
acteristics that the testee admires. 

Apart from their possible clinical uses, X-O tests have besa 
scored in a number of ways. Pressey regarded the total w nl 
crossed out as measure of‘ affectivity ’ or ‘ richness in emotio om 
associations’, There is no confirmation of this, but seorsa in 
dislikes, worries, and blameworthy actions tend to be Br 
neurotics and delinquents.?2 In Form A, Sub-test 1, ‘i aha 
chose the five words in each line so that one should re cael 
disgust tendencies, one to fears, one to sex, one to suspl¢ 
and one unemotional or a joker; e.g.: 


drunk choke flirt unfair white 


to 
Similarly in the fourth sub-test, the words are supposed a 
represent special types of abnormality—paranoiac, neu: 
schizoid, melancholic, and hypochondriacal ; eg. : 


abe ole ae s 2 P erms 
Injustice noise self-consciousness discouragement gi 


of 
There is no evidence that scores based on the numbers 
responses under each of these headings are sp a has 
Empirical Scoring, analogous to Kent and Rosano Areno 
been applied to the encircled words to yield a aria the 
“idiosynerasy ’ or abnormality. Pressey’s own lists ed on 
words most commonly encircled in each line were ist 
small and unrepresentative groups, and the scores are g 
in reliability. Collins listed the commonest a Sa ae 
1500 11- to 14-year-old British children, and found so ormals. 
higher idiosynerasy scores among delinquents than Sreciable 
But, like the free association measure, it fails to give app tona 
and consistent correlations with other measures of C Interest- 
1 Pressey, S. L., and Pressey, L. C., ‘Development of nn lished bY 
Attitude Tests’, J, Appl. Psychol., 1933, 17, 1-16. Test pu 
Psychological Corporation, New York, 1983. d t and Non 
*Cf. Courthial, A., ‘Emotional Differences of Delinquen' 31, 20, NO 
Delinquent Girls of Normal Intelligence’, Arch. Psychol., of Personality 
183. Himmelweit, H. T., and Petrie, A., ‘ The Measurement o: tt, BE. 90 
in Children’, Brit. J. Educ. Psychol., 1951, 21, 9-29. an Normal 
Slater, P., ‘Some Tests for the Discrimination of Neurotic t 
Subjects’, Brit, J. Med. Psychol., 1945, 20, 271-282. Cross-O% 
„CR. Flugel, J. C., and Radclyife, E. J. D., ‘The Pressey “" 98, 8 


Test Compared with a Questionnaire’. Brit, J. Med. Psychol., 
112-181. 
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Teiter and has now dropped out of use. The Interest-Attitude 
est is more promising, since it consists of words that boys and 
girls tend to answer differently according to age, and was 
standardized on 4000 cases aged 11 years upwards. Thus it 
yields a measure of emotional maturity. Pressey quotes high 
correlations with ratings of this trait, but there seems to be no 
further confirmation. Durea ? has prepared a scoring key which 
ae success in differentiating delinquent from normal 
n. 


OTHER WORD ASSOCIATION TESTS 
association test for 


Word-Connection. Another controlled 

adults was devised by Maller and Malamud, and adapted for 
British use by Crown.® Crown’s list consists of fifty stimulus 
words each followed by two responses, one commonly given by 


rn : 
ormals, one by neurotics; e.8.: 


SINK wash 
drown 


Subjects are given this as a printed group test, and tick what 


seem to them the best associations. Their scores are the 
numbers of neurotic choices. Moderate correlations have been 


Obtained with neuroticism in several researches. But scores 
appear to be affected by education or socio-economic level, 
Since normal groups average anywhere from 6-8 (hospital staff) 
to 18-9 (unskilled labourers); neurotic groups average 147. 
he test is very quick and simple to give, and is perhaps one 
of the more promising derivatives of word association. 
Incomplete Sentences Tests. One of the difficulties of ordinary 
ee association is that the single-word responses provide o 
1 Cf. Bri i ' M. B., ‘A Psychological Study 0 
Juvenile D na Bridgen chods >? Genet. Psychol. Monogr: 
ing, E. G., ‘The Predictive Value of Certain 
Se of Emotional Stability as Applied to College Freshmen’. Arch. 
Sychol., 1928, 15, No. 96. ag jle Delinquents „ 
Ohta ee M. A., * Personality a NA of Juvenile 
ä Malle, J.B we Test. Teachers College, Columbia 
niversity, Bureau of Publications, 1934. Malamud, D. L, aus in a 


Maller Co ‘ati S ing Device’. 
ntroll Test as & creening V 4 t. 
1946, 2128743, sn Er The Word Connection List as a Diagnostic 


Test: Norms and Validation’. Brit. J. Psychol, 1952, 43, 103-112. 
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little material to analyse. The very form of the test encourages 
subjects to give non-diagnostic associations. Payne ! suggeste\ 
presenting short phrases in printed form, such as : 


Other people.. 


The subject is instructed to write a few words—the first that 
occur to him—after each. This has the great advantage of being 
applicable as a group test, and although subjects often do give 
superficial responses, and can fairly readily fake if they wish to, 
it usually yields a wealth of emotionally toned material. 
Numerous lists, ranging from 20 to 100 phrases, have been 
published, among the best known being Rohde and Hildreth 2 
Shor’s and Rotter’s.2 Himmelweit and Petrie? find similar 
tests suitable for use with children of 9 to 14 years. Ordinary 
single-word stimuli, but with written sentence responses, WET? 
also used in British Army officer selection during the war. t 
, The responses can, of course, be interpreted clinically, bu 

in addition several methods of reasonably objective scoring have 
been proposed. The simplest is based on the proportion © 

unpleasantly toned ideas to pleasant. Rotter classifies 
responses as unhealthy or showing conflict, neutral, and positive 
or healthy, and lists specimens of each so that others can follow 
his scheme. He claims correlations around -70 with emotion& 
maladjustment. Shor looks for recurrent themes, signs ° 
resistance or evasion, unusual or atypical associations, 4? f 
other variables; while Rohde tries to assess the strength s 
various needs. The more elaborate methods are certainly ns 
reliable and there is less evidence of their validity.4 It v I n 
remembered that this was the most successful of the project 


1 Payne, A. F., Sentence Completions. New York: New York Guidance? 
Clinic, 1928. tion 

amonde, A. R., and Hildreth, G., Rohde-Hildreth Sentence Completion 
Blank. New York: Psychological Corporation, 1940. Shor, J., Rep sp. 
on a Verbal Projective Technique’. J. Clin. Psychol., 1946, 2, BT tes 
Rotter, J. B., et. al., * Validation of the Rotter Incomplete Sen 
eos for College Students’, J, Consult. Psychol., 1949, 13, 3848-356. 

cit., p. 176. sective 

* Cf. Symonds, P. M., ‘ The Sentence Completion Test as a Pi roject! 

Technique’. J. Abn, Soc. Psychol., 1947, 42, 320-329. 
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tests used in Kelly and Fiske’s research on selecting clinical 


Psychologists (p. 26). 


STORY-TELLING TESTS 


res Controlled Projection Test. This should be mentioned 
thin y because, although it does not attempt to measure any- 
eee snd has received no experimental validation, it does 
6 Me e a method of exploration suitable for children from 
d De and for adults, In the individual form the subject 
st a free drawing and is simultaneously told an incomplete 

ory about someone similar to himself, and asked a number of 
ee what did he (or she) think about so-and-so, what 
ae e do next, ete.? The dual task is supposed to produce 
ae spontaneous responses and more identification with the 
ee described, though there is no evidence for this. In the 
Sak, form the subjects look at a picture containing a person 
ab whom they can identify themselves, and answer questions 
® out him in writing. No prineiples of interpretation are 
el, but sample drawings and response® for different ages 
vos reproduced. Foulds ? lists typical, and often very different, 
a Ponses for normal and delinquent boys ; and Kaldegg 3 has 
„pplieq the method in an extensive study of differences In 
amily and other attitudes among English and German children 


and students. 
Self-Description. Candidates for commissions in the British 
roy and for the higher Civil Service are asked to write 
© short descriptions of themselves, first as seen by a 
good friend, secondly by a candid critic. These are inter- 
ren by the psychologist or psychiatrist in conjunction 
ob material gathered during the interview and with other 
Projection test responses. No evidence regarding its value 1s 
available. 
fa, Several writers have made use 
en than Raven, presenting, 
Raven, J. C., Ci ESA 
. C., Controlled Projection Test. 
DE oulds, G. A, R ee Projection Test Responses of a Group of 
tive Delinquents’. Brit. J. Psychol., 1950, Tr 
B * Kaldegg, AS ee of German and English Secondary Saheal 
Oys to a Projection Test’. Brit. ‘J. Psychol., 1948, 39, 30-58. ‘A Study 
of German and English Teacher-Training Students by means of Projective 
echniques’. Ibid. 1951, 42, 56-118. 


ee forms of verbal 


of more fr D 
e, skeleton stories 


for exampl 
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or beginnings of stories and getting children or adults A 
elaborate or complete these.1 These are usually ena 
along the same lines as responses to the Thematic Apperceptio’ 
test (q.v.). 

Griffiths 2 describes the wealth of material obtainable om 
5-year-old children by free story-telling, drawings, accounts Bs 

cams, reactions to simple inkblots, and interviews. She N 
more concerned with emotional development and imaginatio 
in this age group than with Personality diagnosis. j ter 

Literary Productions. Tt ig a truism that the creative W 
painter, musician, scientist, or philosopher expresses Bi 
personality in the style and content of his productions. BU 
discussions of this topic are almost wholly subjective ea 
Speculative. Kretschmer’s pyknic (cyclothyme) and Beth 
(schizothyme) writers, etc., are claimed to show distinctiv? 
styles. William J ames’s tough and tender-minded philosophers 
represent a similar dichotomy. Eysenck and Gilmour,? howe 
classified 107 Philosophers as materialists or idealists on hi 
basis of a short attitude-type test, but found no relationship 


matching experiments (ef. p. 49 f.) have shown some consistency 
between literary style and other expressive characteristics. As 
one investigation, the present writer found it possible to m 
essays produced anonymously by 18 students with en 
of their personalities based on observation during an ee 
testing session, with moderate success. F. Allport, ee ys 
and Lathers 4 carried out a larger study of T E i 
written by a group of students, demonstrating that their med 
was generally consistent throughout. This style often see: Aid 
to throw light on the personalities of the writers, but they rar 

not try to verify this. Probably the uncontrolled lite 


1 E.g. Murray, H. A., “Techniques for a Systematic Investigat’ 
Fantasy’. J. Psychol., 1936, 8, 115-149, Sargent,-H., ‘ An Experi ity 


Test’. Psychol. Monogr., 1944, 57, No. 265. don : 
* Griffiths, R., 4 Study of Imagination in Early Childhood. Lon 

Kegan Paul, 1935, i pers * 
® Eysenck, H. J., and Gilmour, J. S. L., “The Psychology of PhilosoP 

A Factorial Study’. Char. & Person., 1944, 12, 290-298. ition and 
4 Allport, F. H., Walker, L., and Lathers, E., * Written Comper 

Characteristics of Personality ’, Arch. Psychol., 1934, 26, No. 178. 
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roduction i 

Bere is far too complex to provide a basis for any 
approach to personality diagnosis. 


T 
HEMATIC APPERCEPTION TEST (T.A.T.) 


N 
( Bin 
a a ie zng others have used picture interpretation as 
the test in A ying intellectual and emotional qualities, but 
Murray Th present form was first described by Morgan and 
graphs (10 f ey also collected the standard series of 30 photo- 
ANS nick 10 for women, 10 for both). The subject is 
the situati picture and asked to make up a story describing 
together se the events leading up to it, and the outcome, 
Promptin, ith the thoughts and feelings of the characters. 
analysis ET allowed. A verbatim record is kept for later 
the subject e session does not usually exceed an hour, though 
second int may not cover all the pictures in this time. 
the eins ato may be held, or it may be used for discussing 
tebrer and asking the subject himself to help in 
een used ; generally ten 


S 
ae cel other sets of pictures have b 
garded as sufficient. They should be sufficiently ambig- 


uous j 
variety ae to give free rein to fantasy, and should show a 
individual incidents. But each of them should portray an 
younger a the same sex and about the same age aS, or 
imself a ae the subject, so that he can readily identify 
and co nic project his own needs and sentiments, frustrations 
used, al icts, and resistances, on the ‘ hero}. The test is widely 
slides fo, O, ın group written for being projected by 
ffice a at few minutes each. s included in War 
err Civil Service selection, and by the Office of Strategic 
standard It is very desirable that a standard pro 
investi pictures be used, so that the 
igators can be compared. Moreover; 


t 2 . . 
mes or ‘norms’ of performance are established, interpre- 
y unusual elements (just 


tati 
as ag: more readily be based on an. 
Schaf, expressive movements, cf. p- 50 Rapaport, Gill and 
er publish such a list of expected responses for the Murray 
f T.A.T. is Van 


Set, z : e 
An interesting continental version o 

ı 

organ, C. D., and Murray, H. A» <A Method for Investigating 

1935 PR : the Thematic Apperception Test’. Arch. Neurol. & Psychiat., 

, 84, 289-306. Test published by Harvard University Press, 1943. 


13 
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Lennep’s ! 4-Picture test. Jackson’s 2 set of 6 pictures, A Test 
of Family Attitudes, appears suitable for use with children in 
this country (Fig. 6). í : 
Unfortunately the approach to interpretation varies widely 
with the theoretical background of the psychologist. Those 


Gy 


Fig. 6.—Reproduction, about two-thirds actual size of Picture II in 
Lydia Jackson’s Test. 


who follow Murray ® have adopted his elaborate ahh ae ot 
needs and presses (external forces that the individual regar Hk 
beneficial or harmful). But Rotter, Harrison, Rapaport, a i 
Wyatt present simpler schemes.4 These are based on the sty 


1 Van Lennep, D. J., Four Picture Test. The Hague : Nijhoff, 1 eral 

* Jackson, L., ‘ Emotional Attitudes Towards the Family of 35-5), 
Neurotic, and Delinquent Children’. Brit. J. Psychol., 1950, 41, 
178-185. Test published by Methuen, London, 1952. r Oxford 

® Murray, H. A., et. al., Explorations in Personality. New York : and 
University Press, 1988. Sanford, R, N., et. al., ‘ Physique, Personality ns, 
Scholarship ’. Monogr. Soc. Res. Child Develpm., 1943, 8, No. 34. kon, 
S. S., The Thematic Apperception Test. New York: Grune and S 
1947. ods 
‘Harrison, R., ‘The Thematic Apperception and Rorschach pres 15, 
of Personality Investigation in Clinical Practice’. J. Psychol., x ‘ons £0 
49-74. Rotter, J. B., ‘ Thematic Apperception Tests : Suggestio yatt, 
Administration and Interpretation ’. J. Person., 1946, 15, 70-92. Test’ 
F., ‘The Scoring and Analysis of the Thematic Apperception 
J. Psychol., 1947, 24, 319-880. Rapaport, et. al., Bibliography- 
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T str . a8 
5 er. with instructions, language character- 
ren lietieniess coherence, consistency of stories with one another, 
pre re etc., and on the content of recurrent themes— 
ana emotional tone, social and sexual or other attitudes, 
Malate ae (particularly of the hero), and unconscious 
iiterpreter ¢ a defences. It is essential for the scorer- 
Psychologist y pe trained and experienced; the amateur clinical 
responses In who uses any old set of pictures and interprets the 
Burt Ben by intuition should certainly be discouraged. 
and Sen! advocate a still more straightforward 


a . 
Pproach, owing little or nothing to abnormal psychology. 
s such as Level 


h 
of nie the strength of a number of qualities suc! 

dehiiees en or Coherence, Observation ol etails, Verbal 
themes M magination, Extraversion-Introvers on of the main 
Symonds Sy vs. Childishness, Integration VS. Neuroticism. 
represent on the other hand, claims that the stories mainly 
overt repressed drives which may be the very reverse of the 

= personality traits. 

hospital writers have testified to the value of T.A.T. in mental 
tests Er clinics, etc., especially when used along with other 
Conflicti ch as Rorschach. But scientific evidence 1S more 
i ing. Harrison and Rotter ° have been able to show that 


Inte: a 

Be aone of the same material by different testers are 

ably consistent, and have achieved a high degree of 
ith independent case 


Success j J 
„ccess in matching interpretations wi nt 
ist of the characteristics of 


spear 
ns Bell gives an extensive lis ; 5 
us a roduced by various types of psychotics and neurotics. 
ut it i should be genuinely useful for differential diagnosis. 
is much less successful when used for predicting suita- 
officer, clinical 


bili 
ity of personality for some occupation (Army 
ble to estimate the IQs 


ps. i 
Ychologist, ete.).4 Harrison © was & 


1 Unpubli 
Publish 
aS ed memorandum. 
Amer, Peyeholos M., ‘ Interpreting the Picture Story (TAT) Method X 
mer. Psychologist, 1947, 2, 288-289. | Re) 
Themas R., and Rotter, J. B., ‘À Note on the Reliability of the 
“Cf. T Appereeption Test’. J. Abn. Soc: Psychol., 1945, 40, 97-99. 
Printed elly and Fiske, Bibliography. Guilford, J. P-» 
o. 5 late Tests. Army Ait Forces Aviat. Prog: Res. Rep. 
® Harri ashington, D.C.: U.S. Government Printing Office, 947. 
Appereeption R., ‘Studies in the Use and Validity of the Thematic 
940 ee ee with Mentally Disordered Patients °. 
‚©, 122-138, 3 
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of 87 cases, with a validity of -78. Sen! finds the reliability 
of assessments by different scorers on Burt’s qualities: to 
average only -4; nevertheless such qualities as Observation, 
Verbal, Level of Organization and Maturity gave very prormmgh 
correlations with follow-up results among high-grade eivi 
servants. Some attempts have been made to objectify the 
test by providing multiple-choice questions or stories to choose 
from, but this seems to be a blind alley. 


OTHER PICTURE TESTS 


Shneidmann’s Make a Picture Story Test (MAPS).? ahe 
consists of 22 background pictures, and 67 figures, some © 
which are arranged on the background by the subject, while he 
tells a story about what they are doing. So far this has Ds 
used chiefly with schizophrenic patients, and a series of object- 
ively scorable characteristics (in addition to qualitative 
features) have been found to differentiate them from norn eos 

Rosenzweig’s Picture-Frustration (P-F) Study. Althoug 
this picture test also involves somewhat subjective scoring a 
interpretation, its limitation to a few fairly clear-cut personali y 
trends is an advantage. Itis based on Rosenzweig’s classifica 
tion of people’s reactions to frustrating situations: some ten 
to blame others or the environment (extrapunitive) ; some to 
fault with themselves (intropunitive); while some tend 
minimize or evade the frustration (impunitive). Adults = 
children’s tests are available, and they can be given in gr 2 = 
form. They consist of two dozen cartoons or line ide ‘ 
in which one person describes a situation or makes a rem tes 
which deprives, disappoints, accuses, harms, or inerimine, $ 
another. The subject is instructed to write the first words on 
come to mind as the response of the second (frustrated) pas A 
A number of categories of response are distinguished, $ as 
specimen answers are provided to help in scoring. There ! 

1 Op cit. 5 

a E. S., ‘Schizophrenia and the MAPS Test’. Genel 
Psychol. Monogr., 1948, 88, 145-223. 


z ication 
® Rosenzweig, S., ‘ The Picture-Association Method and its A Be 
in a Study of Reactions to Frustration’. Char. d& Person., 1945, 14, 
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yet little evidence to prove that the resulting scores do corres- 
pond to typical frustration reactions in everyday life. However, 
IR of the categories—need-persistence (measured by responses 
ae stress searching for a solution)—correlated positively with 

e persistence factor in MacArthur's research (cf. p. 14). 
k The Szondi Test.“ This is the most bizarre in our catalogue. 
h consists of 6 sets of 8 photographs. In each set of 8 is a 

omosexual, a murderer, an epileptic, an hysteric, a catatonic, 
4 paranoiac, a manic, and a depressive. The subject chooses 
di es in each set which he likes most and 2 which he 
dislikes, This should be repeated on several occasions. Accord- 
ing to Szondi, it is not those who show these personality 
characteristics overtly who like the corresponding pictures, but 


those in whom the tendencies are latent (owing to recessive 
d tendencies are 


ee): Repressed, rejected, oF sublimate r 

epresented by dislikes. Although the chief English-speaking 
Proponent of the test, Deri,? has put forward a less esoteric 
u of the theoretical basis and uses of the method, there 
ems to be no evidence of its value, apart from case-study 


material. 


others 

put together. On the other hand, a : 
ay for Rorschach testing to become & cult, like bee 
Ey sis, with the same tendency to aoma n 
n and to dissenting sects, and the same implicatio 
fre ly the initiated can ® nderstand it, and that it is immune 
= ordinary scientific standards of ee a 
use it is concerned with the ‘total’ pers ty- 
i ate. 

FH n 1000 books and articles have been published on it to d 
Szondi, L., Szondi Test. New York: Grune and Stratton, 


2 Deri : 
Steerer S., Introduction to the sandi Test New York 
ratton, 1949, Grune and Stratton, 
1942 07Schach, H., Psychodiagnostics. eier 1946. 
1942 ; London: Methuen. Blots (4t edit.), same P“ ges and Practice 
of a introduction for British readers js Mons, W-» Princip! 

Rorschach Personality Test. London : Faber, 1948. 


New York : 
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Since 1937 the Rorschach Research Exchange has been published, 
a journal exclusively concerned with the test, though this has 
now been expanded into the Journal of Projective Techniques. 
Many psychologists from Binet onwards have used meaning" 
less inkblots as material for investigating the imagination. 
Rorschach, a Swiss psychiatrist, tried out a large number © 
rather elaborate blots with mental hospital patients before 


Fig. 7.—Outline reproduetion of Rorschach’s Blot No. II. 
(Nine-sixteenths original size.) 


publishing the standard set, of 10, together with his monograph» 
Psychodiagnostik, in 1921. [Administration is very simple. Pee 
blots (some black-grey-white, some coloured (ef. Fig. 7) ch, 
presented in turn, and the subject is asked what he sees in e& as 
what it makes him think of. He is encouraged to pee he 
many associations as possible, and to turn the cards round I a 
wishes. The examiner records unobtrusively, usually 0? 


È N ies, the 
Specially prepared form.2 After completing the series; 


ts” 
1Cf. Tulchin, S. H., ‘ The pre-Rorschach Use of Inkblot Tes 
Rorschach Res. Exchange, 1940, 4, 1-7. Cf. also Hertz, Bibliography" ach 
* E.g. Klopfer, B., and Davidson, H. H., Record Blank for the Ror 1939 
Method of Personality Diagnosis. New York: Rorschach Institute, 


ne Projection Techniques 187 
dene aie dein in order to elueidate what determined 
Westie .the m ich parts of the blots they refer. In Klopfer’s 
probes deeper ae , there is a third stage where the tester 
spontaneous! an prong responses which were not produced 
responses, ce or a normal production of about thirty 
rel ehornous a usually takes less than half an hour, but there 
none sein ariations; some patients give less than ten 
may take e several hundreds. Scoring ‘and interpretation 
Each e several hours. 
response is scored under three headings : 


1. 
Mode of apperception—whether the whole plot (W 
1 (D) or unusual 


zespons e) an ordinary detai 
etail (Dd), or based on & white space in the blot 


IT (DS), etc. 
» The determinant—the form or shape (F+ or = 
according to aptness), colour (C), shading OT 
ing movemen! 


chiaroscuro (K), or a response implyin 
i colour and move- 


(M). The relative proportions © 
ment responses yield the Erlebnistypus or experience 
ve (excess of i 


balance—extratensi 
versive (excess movement), man 
UL c (dilated ambiequal), few oF none (co 
. Content—original (O+, ~» or common response 
al (A), anatomical, object, 


(P); human (H), anim 
B geographical, nature, architecture, plant, etc. 
eee aid Klopfer,! the leaders of the main American “sects °’, 
the main Bea a of more detailed scoring cate ories. Each of 
or profile ypes of response is summed to give the psychogram 
and their and interpretation is based mainly on these scores 
both | GR eae pr The profile epitomizes, 8S it were, 
Rn aele ape affective life ect. Thus 
rehome nee is shown by good form, origina an 
synthetic,’ but an excess of W shows > more abstract An 
intellect,” of D a more practical, and of Dd a pecan. 
e color Colour corresponds roughly to tionality, aD 
tionalit LE SLESPOUSES show little influence of form, this emo- 
y is poorly controlled. 
d’. Res. Monogr. 


1 Be 

Aimer. oe . J., Introduction to the Rorschach Metho 

The Ror. opsychiat. Ass., 1937, No. 1- Klopfer, B- and Kelley, D. M., 
schach Technique. Yonkers, N.Y.: World Book Company, 1942. 


Movemen 
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a rich inner life rather than overt emotion, somewhat like 
Jung’s introversion, and the various kinds of shading responses 
show sensitivity and anxiety. Some of the content categories 
have special significance; thus the commonest responses, 
animals, show lack of imagination, anatomical responses, 
morbidity,1 ete. Though particular responses are sometimes 
explored, e.g. along psychoanalytic lines, content plays & 
much smaller part in interpretation than it does in the clinical 
application of word association tests or in T.A.T. Indeed many 
psychologists regard Rorschach and T.A.T. as complementary, 
and prefer to apply both to their subjects. The sequence © 
Rorschach responses, and the context in which certain ones 
appear, may also help in the interpretation of neurotic trends 
and resistances. The significance of any score natura ly 
depends on the extent to which it deviates above or below the 
normal score for other similar people. Numerous tables © 
norms have been-published, but there are such wide variations 
with age and education of the subjects, probably also with the 
manner of giving the test, and with productivity or to 7 
number of responses, that these are far from satisfactory- 
The above account is much over-simplified, and referenc® 
should be made to Klopfer’s and/or Beck’s manuals. 2 
most essential for the Rorschach tester to receive thoroug i 
training, and fortunately this is fairly readily available * 
U.S.A. and in England. e 
Reliability and validity are difficult to investigate, bei 
Rorschachites insist that no category should be considere tal 
isolation.? (The dependence of every category on the to a 
number of responses, and the skewness of score distribution 
constitute additional snags.) ? Nevertheless, interpretations © 
frequently based on small differences in category scores ; 


example, 8 Colour-Form and 1 Form-Colour would be regarde 
py fact! 
nd 


1 Sandler and Ackner haye recently classified types of content 0565 Oh 


analysis. On comparing their categories with psychotic diagn 

symptoms, anatomical responses were found to be strongly associa’ drome: 

an insecure-aggressive, rather than an hypochondriacal, EYT xpe 

Sandler, J., and Ackner, B., ‘Rorschach Content Analysis: An 

mental Investigation’. Brit. J. Med. Psychol., 1951, 24, 180-201. sective 

T * Cf. Ainsworth, M. D., ‘Some Problems of Validation of Pro) 4 
echniques’. Brit. J. Med. Psychol., 1951, 24, 151-161. h score?” 

Cf. Cronbach, L. J., ‘ Statistical Methods Applied to Rorschac 
Psychol. Bull., 1949, 46, 393-429. 
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Bay ‘different «frome Colour-Form and 8 Form-Colour 
ane Thus it is desirable that such scores should possess 
ath ity and consistency. The ordinary split-half and retest 
to ods of studying reliability have been used, but are open 
th criticism. Perhaps the fairest method is to compare with 
e results of a parallel series of blots, two of which are available 
Oben by Behn-Eschenburg, one by Harrower. Meadows * 
bet ined correlations averaging only ‘50 for 25 score categories 
Eta too. Rorschach and Behn responses of 100 normal persons 
wi 00 neurotics. They ranged up to ‘84, put many coefficients 
ere very low. The reliability of scoring might also be queried, 
a Several experiments have shown that different experts do 
aes very closely. Tables have-been published to assist m the 
Ta ring of doubtful responses (good vs. Poor forms, norma y 
re details, etc.). Other researches indicate that it is scarce y 
ae ssible for a subject to fake a good or poor personality; unless 
Course, he has been primed. Temporary mood, however, = 
FR rapport between the subject and the particular tester do 
Cle to have some influence.” oki 
y A TA for the diagnostic significance of the categor: 
angel i ight appear, 
meee is kos tates ee in different neurotic and 
a Yehotie groups. A typical manic 
a different psychogram from a typ 
stone Tate correlations are often obtaine 
neur s and intelligence tests ; Hertz * quota h 
sho: oticism, though these have not been confirmed. 
e wn by factorizing the various category scores ana 
a reduced to three main components which might be ae 
a sa fluency or productivity, intelligence gna nee 
ency. Correlations of around were obtained bervor 


Fo 


Blots. New 


ach and Behn Tests. 


2 
Php adows, A. W., An Investigati an 
. Thesis, University of London, 195° er 
Br Lord, E., = Experimentally Induced en 
a panance ’, Psychol. Monogr., 1950, 64, BER: Historical Summary % 
eyehol. 2 M. R., ‘ The Rorschach Inkblot Test : 
Be Bull., 1985, 82, 33-66. Teig. 
n, A. "A Statistical Study of the Rorsenae 


Stati: 
alist. Sec., 1950, 3, 21-39. 


in Rorschach 


Brit. J. Psychol., 


190 Personality Tests and Assessments 


the factor scores of students and associates’ ratings on thee 
traits. This type of piecemeal validation is, of course, pct 
foreign to the manner in which the Rorschach is normal y 
used, though the fact that it is successful suggests N 
might be turned into a more objective personality test. Match- 
ing methods of validation have been tried, though they, oe 
have their disadvantages. Undoubtedly they show on 
diagnoses based on the psychogram as a whole (even w a 
interpretation is done ‘ blindly ’, without seeing the Sabi a 
themselves) can be identified with case-studies or clinical da 
with considerable success.! Thus the skilled Rorschach te 
can certainly help in the differential diagnosis of men 
patients. He seems, however, to be less able to make he 
predictions about normal persons. Thus the test was of bbs! 
little value in selecting clinical psychologists. Nor did any k 
the scores correlate with success among U.S. Army Air F TE 
pilots, but predictions made from the complete psychogra 
did yield an appreciable validity coefficient. 3 er? 
Several modifications of technique have been tried. N 
has worked out a method of group examination, where the E 
are shown by slides and subjects write their responses the 
prepared forms. This seems to yield similar material to 5 
usual method, but rather less complete. Munroe 3 develop B 
an ‘inspection technique ’ for rapid and relatively objec és 
scoring. The presence or absence of twenty-eight a ea 
emotional instability in group records was counted, and dents 
sign-scores gave rather promising validity in identifying ee ae 
whose academic work fell below the level expected from 5 an 
ligence tests and previous examinations. Similar is Bühl na 
Lefever’s 4 method of scoring for adequacy vs. malfunctio) vill 
+ Cf. Vernon, P. E., ‘ The Significance of the Rorschach Test valida 
J. Med. Psychol., 1935, 15, 199-217. Krugman, J. E., ‘A Clinical anges 
tion of the Rorschach with Problem Children’, Rorschach Res. Exc ostic 
1942, 6, 61-70. Benjamin, J. D., and Ebaugh, F. G., ‘The Diagn 178. 
Validity of the Rorschach Test °. Amer. J. Psychiat., 1938, 94, 116 SC 
* Harrower-Erickson, M. R., and Steiner, M. E., Large Scale Ror 
Techniques. Sprin field, Ill. : Thomas, 1945. io Per“ 
BN inte, R. te ‘Prediction of the Adjustment and Academic, Fs 2 
formance of College Students by a Modification of the Rorschach M 7 
Appl. Psychol. Monogr., 1945, No. 7. the Baste 
4 Bühler, C., Bühler, K., and Lefever, W. D., Development of rgchach 
Rorschach Score with Manual of Directions. Los Angeles, Cal. : Ro! 
Standardization Studies, No. 1, 1948. 
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sic been . Unfortunately Munroe’s technique failed to 
oat preciable correlations with college grades in another 

ent by Cronbach, with a rather different type of student 


Populati e 
en though there was moderate agreement with 
ised y staff ratings of social adjustment. Harrower also 
a eTA O a ; ine test. The. blots 
n on slides and nine possible responses to each are 


presented, some commonly given 
neurotics (these are listed Be Gyeench). The 
which ‚he thinks appropriate and his score is the 
neurotic choices. This seems to have very - validity, 
more reliable method suggested by Eysenck, 
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with neuroticism. 


ARTISTIC PRODUCTIONS 
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C Š 
ge A permanent recor’ 
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are cla: a but more commonly they create abstract ones whic 
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use of ns to success or the completeness of the 
hou, h ge or frame designs, winged an 
of de a considerable amount of data is ble type 
amon gns occurring in different psychopathologic® groups,’ an 
to pie children of different ages, the test can 
ee an objective method of diagnosis. 
i whom she has trained, can give remark: 
ach, L. J., ‘Studies of the Group Rorschach 
1950, 4l, a Coliere of the University of Chicago’. J- 
aaron, M., i » Amer. J. Orthopsychiat., 1949, 19, 
ee Det, ae MoR by Institute of Child Psychology, 
s read Psychological Corporation, New York. 
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the factor scores of students and associates’ ratings on the 
traits. This type of piecemeal validation is, of course, entiset 
foreign to the manner in which the Rorschach is norma y 
used, though the fact that it is successful suggests that if 
might be turned into a more objective personality test. ae 
ing methods of validation have been tried, though they, a t 
have their disadvantages. Undoubtedly they show be 
diagnoses based on the psychogram as a whole (even W. = 
interpretation is done ‘blindly ’, without seeing the subjeci 
themselves) can be identified with case-studies or clinical are 
with considerable success.! Thus the skilled Rorschach tes ‘al 
can certainly help in the differential diagnosis of we d 
patients. He seems, however, to be less able to make Ten 
predictions about normal persons. Thus the test was of ve 5 
little value in seleeting clinical psychologists. Nor did any i 
the scores correlate with success among U.S. Army Air a 
pilots, but predietions made from the complete psychogr& 
did yield an appreciable validity coefficient. 2 
Several modifications of technique have been tried. Harr ie 
has worked out a method of group examination, where the ee 
are shown by slides and subjects write their responses Ai 
prepared forms. This seems to yield similar material z 
usual method, but rather less complete. Munroe ® deve wi 
an ‘inspection technique’ for rapid and relatively obje 5 
scoring. The presence or absence of twenty-eight SE hese 
emotional instability in group records was counted, an dents 
sign-scores gave rather promising validity in identifying ae È 
whose academic work fell below the level expected Tom : an 
ligence tests and previous examinations. Similar is rape 
Lefever’s 4 method of scoring for adequacy vs. melma we 
* Cf. Vernon, P. E., ‘ The Significance of the Rorschach oe ‘valida- 
J. Med. Psychol., 1935, 15, 199-217. Krugman, J. E., ‘ A Clinica changer 
tion of the Rorschach with Problem Children’. Rorschach Res. Ex ostic 
1942, 6, 61-70. Benjamin, J. D., and Ebaugh, F. G., ‘ The Diagn 178: 
Validity of the Rorschach Test’. Amer. J. Psychiat., 1938, 94, 116 
* Harrower-Erickson, M. R., and Steiner, M. E., Large Scale Ro , 
Techniques. Springfield, IIl. : Thomas, 1945. ic Pet 
oe, R LS ‘Prediction of the Adjustment and Academic, pä n 
formance of College Students by a Modification of the Rorschach Q 
Appl. Psychol. Monogr., 1945, No. 7. the paste 
* Bühler, C., Bühler, K., and Lefever, W. D., Development of rachat” 


Rorschach Score with Manual of Directions. Los Angeles, Cal. : RO 


Standardization Studies, No. 1, 1948. 
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of personality. Unfortunately Munroe’s technique failed to 
yield appreciable correlations with college grades in another 
experiment by Cronbach,! with a rather different type of student 
population, though there was moderate agreement with 
dormitory staff ratings of social adjustment. Harrower also 
devised an entirely objective multiple-choice test. The blots 
are shown on slides and nine possible responses to each are 
presented, some commonly given by normal people, some by 
neurotics (these are listed by Eysenck). The subject ticks those 
which he thinks appropriate and his score is the number of 
neurotic choices. This seems to have very Poor validity, but a 
more reliable method suggested by Eysenck, where the subjects 
rank each set of nine responses, has given small correlations 
with neuroticism. 


ARTISTIC PRODUCTIONS 
Lowenfeld’s Mosaic Test? The subject is given & box 
containing some 2 to 400 small squares, triangles, and diamond- 


shaped pi i k, red, green, blue, and 
ped pieces, coloured white, blac H no 


yellow—together with a wooden tray line 1 h 
white paper. He is told to make anything he likes with the 
pieces, and is encouraged to go ON fied with his 
construction. A permanent recor 
ae the pieces before removing t 
olours. A bjeets make concre n 
patterns, ER Dok cial) they create abstract ones which 
are classified into various types * 
also according to success OF the completeness © the 
th, es or frame designs, winged 3 
ough a considerable amount of data is availa 
= designs occurring in different psychopathological groupa and 
among children of different ages, the test can hardly be sa 


to provide an objective method of diagnosis. u 
ake ined, can give remarkably penetrating 


those whom she has tral nee 

1 Cronbach, L. J., ‘Studies of the Group Rorschach ee A i 
Inge ss in the College of the University of Chicago’. T- 2 4 

4l N i 

2 Lowenfela M., ‘The Mosaic Test’. Amer: J: Orthopsychiab, 1040, n 
SOTRO50: Test material published by Institute of Chil sy! i 
London, and Psychological Corporation, New York. 

3 Summarized by Bell. 


192 Personality Tests and Assessments 


accounts of personality structure, but they are less sucosssell 
so far in laying down definite Principles of interpretation wbich 
others can follow. Kerr, and Himmelweit and Bean 
quote experiments in which experienced testers successfi ay 
matched Mosaics with personality sketches, or wrote skete ns 
from the Mosaics which were identified by psychiatrists. In 
tester was, however, unable to predict the answers of neuro a 
patients to a personality questionnaire. Notable A f 
of Mosaics over other projection tests are that it is quite i ly 
and that it can be given many times over. Thus it is particu. ae 
useful for following through personality changes during psy¢ 
logical treatment. : 2 have 

Drawings and Paintings. Psychological studies of art 
concentrated chiefly on: 


(a) Developmental stages—A useful application of ne 
work is Goodenough’s ‘ Draw-a-Man ° performa 
test of intelligence ; 

(b) Cultural influences ; hizo- 

(c) Artistic productions of psychotic patients, e.g. SC 
phrenics 3 ; es non 

(d) the therapeutic value of drawing and painting a ceur 
maladjusted children, and the changes that E 
during the course of treatment. Among adult e: 5 
also, particularly those who are treated by talova en: 
Jung, drawings are sometimes analysed in the 
manner as dreams ; ae 

(e) Tests, and factorial analysis of types, of artistic 3 

It is very widely assumed in child guidance that the style eal 
content of drawings provide valuable diagnostic hpa cance 
gives a lengthy table of aspects or elements and the signifi rious 
of each for personality which has been alleged by va ther 
authors. But there is an astonishing dearth of evidence rou 
than case-study material which, without a control & 


chides 
1 Kerr, M., ‘ The Validity of the Mosaic Test’, Amer. J. Or a Heperi 
1939, 9, 232-286. Himmelweit, H. T., and Eysenck, H, da p 
mental Analysis of the Mosaic Projection Test’. Brit. J. Med. 
1945, 20, 283-294, ure 
scf Goodenough and Harris, Bibliography. iterat! 
` Gf. Anastasi, A., and Foley, J. P., “A Survey of the Literatur, 


40. 
on Artistic Behavior in the Abnormal’, Psychol. Monogr., 10a 
0. 287. 
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such a ae _ For example, one cannot claim that such-and- 
one has also he characteristic of ‘ anxious > children, unless 
awings of n wn that this feature is relatively absent from 
which gave str ormal children. Even the matching method. 
a tee ten the validity of T.A-T., Rorschach, 
adult associati as been neglected. It is far too easy to read 
say that fenc ions into children’s productions—for example, to 
anxieties, et es represent repressions, dark colours or shading, 
Windows a Distorted human figures, houses with tiny 
the like often arise merely from backwardness or 


from 
defecti Š 
ive drawing skill. (Some authors recommen 
an paint-brush, 


nger-painti 
erayor as making fewer demands thi 
Proved that anol) However, Goodenough and others have 
re tend to be more incongruities in drawings of a 


Man b 
ym i 
aladjusted and delinquent children than in those of 
ow their Binet 


normals ; 
ental ae such children often score bel 
= awings of En her test. Buck uses as & test for adults 
ACcompanie: ouse, a tree, and a person -T-P Test), and 
interpretatio these by a series of questions. 
Subjective on, largely based on Freudian symbolism, i 
n conclus nd lacking in validation. 
ere ern we do not deny that drawings and paintings 
€xplorator Ive of personality, nor that they are valuable as an 
established. tool in clinical treatment. There are also well- 
Pathologie, wer: in the producti f various psycho- 
complex th groups. But the origins of any feature are SO 
8tapholo at interpretation is in muc the same state as WaS 
By 50 years ago. 


AESTHETIC APPRECIATION 


Man: 
y wri 
classify mad on art, such as Herbert Read, hav 
2 Waehn modes of appreciation, 4S well as type 
t e t y 
ae Detain ee a system of analysis of adult drawings 
y wit! ity sketches derived from this can be matched very ps 
E Interpretation 


o h s 
£ Spon ketches based on Rorschach. Waehner, T. S., 
> 3-70. gs’. Genet. ‘Psychol. Monogr» 1946, 


tay 
x neous Drawings and Paintin; 
G - Napoli 
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production, and have suggested relationships with Jung’s oF 
other systems of personality types. On the basis of intro- 
spective experiments, Bullough, Valentine, and Myers derived 
four main types of response to colours, musical tones, and other 
aesthetic stimuli: the objective or technical, the subjective or 
emotional, the associative, and the characterizing types. They 
did not claim, however, that individuals adhere consistently to 
one type, nor did they attribute these differences to personality 
traits. This is a problem which is amenable to experimenta: 
investigation, using Burt’s technique of factorizing correla- 
tions between persons. Individuals are asked to rank gee et 
of pictures or other artistic objects in order of appreciation 
(or rate them on an appropriate scale). When their orders 
are inter-correlated, a fairly high degree of overlapping 5 
usually found even between experts and untrained adults an 
children. The correlation between each individual’s order 
and the average or standard order may be taken as a measure 
of his artistic taste. Eysenck ! finds some consistency 1” 
taste even when measured by such different sets of materials 
as reproductions of portraits, landscapes, statues, furnitures 
abstract figures (polygons), pieces of poetry, and odours: 
But over and above this general factor, he finds that on? 
or more bipolar factors can be extracted, showing that som? 
people have a fairly consistent preference for one type of ae 
others for another. The most clear-cut of these dichotomie® 
is that between highly coloured, fairly simple, and impression 
istic artistic styles and more complex, subtle and tama A 
or classical styles. Eysenck provides some evidence t D 
preference for the former type is associated with extraversio”, 
also with radical vs. conservative attitudes and (inverse 
with age. «pronto 
Similar factors have been distinguished in the appreciation p- 
poetry and music, though there is less evidence of their that 
nection with personality traits. Burt ? claims, however, “ek 
persons belonging to his four main temperamental types 
1 Eysenck, H. J., ‘Some Factors in the Appreciation of Poetry, DE F 
their Relation to Temperamental Qualities’. Char. & Person., N "Ie 
160-167. ‘The General Factor in Aesthetic Judgements RI B ” 
Psychol., 1940, 31, 94-102. ©“ Type ”-Factors in Aesthetic Judgem®' 
Brit. J. Psychol., 1941, 81, 262-270. Char. & 


* Burt, C. L., ‘ The Factorial Analysis of Emotional Traits °- 
Person., 1989, T, 238-254, 285-299. 
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p- 16) tend to show characteristic tastes in all the arts, as 


follows : 


Unstable extraverts like romantic, emotional, and dramatic 
human interest, e.g. Titian 


productions, with strong 
and Rubens, flamboyant Gothic architecture, Wagner 
and Strauss, Byron and Shakespeare. 

Stable extraverts like more objective and realistic art, for 
the associations it arouses rather than the emotions, 
e.g. Raphael and Chardin, Handel and Verdi, Johnson 


and Macaulay. 

Unstable introverts like mystical and impressionistic art, 
e.g. El Greco and Blake, early Gothic, Debussy and 
Delius, Shelley and Yeats. 

Stable introverts like intellectual and formal productions, 
eg. Van Eyck, Vermeer, and Cézanne; Bach ; 
Wordsworth and Henry James. 

tcard reproductions, people judged 

did express liking for 73%, of the 

of the non-corresponding types of 

lausible, but there is no 


In one experiment with pos 
as belonging to these types 
corresponding and only 42% 
Pictures. The other attributions are p. 
published evidence for them so far. 


PLAY METHODS 
Older theories explained play as resulting from surplus 
he instincts, or aS recapitulating the 
evolution of the race. Nowa regarded asia 
medium of self-expression i ishes, anxieties and 
conflicts, and socially unac re worked out. 
Thus it is particularly valua of young 
children, for whom language is such a d 
municating emotions. Like artistic produc 
certainly does not provi 
One would expect that fantasy 
compensatory, and that quite di 
expressed, e.g. in doll pla: u 
the family, arte a athe children. Inevitably, there- 
fore, subjective interpretation A 
a clue to personality, and there are grave dangers in jumping 
to conclusions about childr 
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observations and a slight acquaintance with psychoanalytic 
theories. È 

A further difficulty in adapting play as a projection technique 
is that it does not normally leave a permanent record ; descrip- 
tions by an observer or play therapist introduce many possible 
sources of error. Researches under Sears! at Iowa show that 
valuable results from the standpoint of personality theory can 
be obtained by the application of exact observational techniques 
to various aspects of doll play. An alternative approach is to 
employ a standard set of toys with which the child constructs ® 
scene, or plays out a drama, and to photograph the resulting 
constructions. This is the basis of Lowenfeld’s 2 ‘ World ’ test, 
so called because the child portrays, as it were, his view of 
world with miniature people, animals, houses, trees, 
Bühler and Kelley have collected and published a set % 
materials for this test, which they regard as a test of emotion’ 
disturbance. The method has been found useful also W? f 
adults.? Moreno’s 4 psychodrama is a more elaborate method 9 
involving adults in play for diagnostic and therapeutic purp o 
Tt is far too dependent on the skill of the director-interp wet 
to be considered seriously as a diagnostic test. Bow 
aa coiinects up with the group observational methods 

ap. VI. i 

Although play provides such a rich field for personality 
investigations among children, and has a very extens i. 
literature, it seems unlikely to yield any convenient or pra aly 
cable tests. We should recall, however, that it does not ple 
reflect unconscious motives. As described in Chap. VI, varom 
measurements of social and character traits were obtained o, it 
time-sampling and tests based on play. Obviously, topi 
provides an indication of interests. It would take too 


etc. 


ip 
s 
1 Cf. Pintler, M. H., Phillips, R., and Sears, R. R., ‘ Sex Differene"g1, 
the Projective Doll Play of Pre-school Children”. J. Psychol., 194% 
78-80. a Tech” 
* Lowenfeld, M., ‘ The Nature and Use of the Lowenfeld wos 981+ 
nique in Work with Children and Adults’. J. Psychol., 1950, 80, S onional 
Bühler, C., and Kelley, G., The World Test. A Measurement of E 
Disturbance. New York: Psychological Corporation, 1941. jg, tb? 
* Cf. Bolgar, H., and Fischer, L. K., ‘ Personality ProjectioP a 
World Test’. Amer. J. Orthopsychiat., 1947, 17, 117-128. | of socit 
‘Moreno, J. L., Sociodrama; A Method for the Analysis % 
Conflicts. New York: Beacon House, 1944. 
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observe and record all a child’s play activities in order to survey 
his interests ; and the results might have poor predictive value ; 
(for example, early mechanical interests among boys are 
notoriously unstable). But the Pressey X-O and Strong Interest 
Blank try to cover the same ground at a more sophisticated 


level. 


SENSE OF HUMOUR TESTS 


The mechanism of projection certainly enters into our 
appreciation of the comic, and there have been some attempts to 
base personality tests on the types of jokes appreciated. 
Eysenck ! found that when people rank jokes or other humorous 
material in order of funniness, or rate their appreciation, there 
is remarkably little agreement between them. Some persons, 
however, consistently tend to rate the simpler sexual and 
aggressive jokes more highly, whereas others consistently prefer 
more subtle and clever humour. In statistical terms, there is 
no clear general factor in the correlations between raters, but 
there is a prominent bipolar. Some correlations were found 
between the latter factor and extraversion-introversion, as 
measured by a personality questionnaire in normal subjects, 
and there was a slight tendency for hysterical patients to rate 
the sexual type more highly than did dysthymics. 

A much more elaborate test has been published by Cattell and 
Luborsky,? where some 200 jokes have been classified by factor 
(cluster) analysis into 11 pairs of types ; for example : 
rdant and morose. 

c. vs. stable acceptance. 
light badinage. 


i Be N ; claimed to 
High positive or negative scores ON cach of these ee certain 


derive f ‘ 
rom repressed personat : 
Son elationg ba been fo i ality-questionnalré 
N For example, the first type 
o surgency-desurgency. The scheme is pro 


1 Eysenck, H. J., ‘Th iati 
» H. J., e Appreciatit 
Theoretical Study’. Brit. PP Psychol., 1942, 32, 205-309. in Response 
2 Cattell, R. B., and Luborsky, L. B-» + Personality ar a nor Te. 
to Humor”. J. Abn. Soc. Psychol, 1947; 42, 402421. T sting, 1950. 
Champaign, IIl. : Institute for Personality and Ability Testing, 
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for many of the scores have low reliability, and they overlap 
considerably. Indeed when they are factorized they appear a 
resolve largely into the same bipolar dimension that Eysene 
found. Moreover, the significance of the various kinds of jokes 
is dubious. When several judges were asked to classify then 
under Cattell’s 11 types, they showed little agreement. Nez 
theless, the test is both genuinely projective and objective, an 
clearly merits further development. 


XI 


c : 
‘onclusions and Future Developments 


ions of our survey, 
tions in whic 

ed—selection, 
The field of 


N tryin 
I CEE NS. the practical implicat 
Personalit able to distinguish three main situa 
experim y tests or assessment: 
uch, and diagnosis or 
affected bes the most straightforward because it is 
unsolved y the difficulties of personality theory 
18 no Loe oD emp discussed in Chap- ~ 3 
pr personalit reach agreement as to the main traits oF dimensions 
can be dete y. The value of any proposed test or ot 
criterion s ermined directly by comparison with some external 
At the s uch as educational or vocational success and failure. 
likely to be e time, progress along pur irical lines is 
argely on aos the choice of suitable methods depenes 
experiment, equate personality theory, and 0 
oe aa and diagnostic study of ersonality- And, as 
IScovery of Parry point out, follow-up 
application good external criteria are far from easy: 
Need for s of tests for selection is handicapped, to 
highly eee tests to be short and t depe 
e ea testers nor on elaborate aP and so forth. 
Pencil test iore naturally ask first how use e 
candidate: s, since they can generally be applied groups © 
Cost, Test and scored, by slightly trained tester a 
p: 163) the of interests and attitudes such as the Stron: 
alues ( e Kuder Preference Record (p. 1 3), and the Study of 
ough 162) have certainly prove heir worth In America, 
and it is similar is immediately available 10 
or for 11 oubtful how far they are suitable for a% 1 
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such tests is their susceptibility to faking, and they should at 
least be supplemented by objective information tests (p. 91). 
When a small number of interests is involved, as in selection for 
technical schooling, Peel and Lambert’s technique (p. 91) may 
be an improvement. Other aptitude tests to some extent reflect 
personality and interests; for example, Moss’s Social Intelligence 
Test (p. 92) might well be superior to ordinary intelligence tests 
in selecting for appointments involving social contacts. 

It is difficult to see any use whatever in selection for miscel- 
laneous personality inventories like the Bernreuter, Bell, 
Guilford, Boyd, etc., valuable though it would be to have 
measures of such traits as instability, extraversion, an 
ascendance. The reasons are fully set out in Chap. VIII. The 
only type worth considering would have to be at least as 
disguised as the Bennett-Slater (p. 125), and should preferably 
employ the forced-choice principle. That is, self-rating items 
should be tried out in actual selection situations; a smal 
number of valid items should be chosen on the basis of correla- 
tions with an external criterion, and combined with invalid 
ones In such a way as to defeat the faker or the malingerer. 

More factual biographical inventories, perhaps containing & 
few relatively innocuous self-deseription items, are worth 
developing but, as pointed out on P. 136, they necessitate large 
numbers and a rather stable selection set-up. In the case OF 
children we advocated (p. 121) the construction of a similar 
third-person questionnaire and rating scale for teachers to apply» 
whose items would be empirically scored after validation 
against, say, secondary school success. 

Few of the well-known projection tests are suitable. They 
are too lengthy, much too dependent on the subjective judgmen 
of the tester-interpreter, and evidence for their validity is poo™ 
Thus it seems unlikely that the group Thematic Apperception 
contributed anything worth while to the selection of Britis? 
Army officers or civil Servants. Sentence Completion (p. 177) 
is most free from these defects, but needs to be put across very 
carefully if candid responses are to be obtained. Picts 
Frustration (p. 184) might be tried, and group Rorschach score t 
by inspection technique would be worth validating agains 
external criteria. A useful, quick, and objective controlle 
association test might be developed from Malamud’s oe 
Crown’s Word Connection List (p. 177) and Pressey’s Interes 
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Attitude test (p. 176), given sufficiently large numbers for the 
preparation of up-to-date scoring keys. 

Physical and physiological measurements and tests of 
behaviour must be approached very cautiously. The electro- 
encephalograph does indicate certain types of abnormality, and 
it is possible that the psychogalvanic reflex or other measures 
of autonomic functioning might prove relevant to certain jobs. 
Probably, however, they are too variable, and involve too 
elaborate recording. Static ataxia (p. 80) is one test of proven 


validity, and Sheldon’s somatotypes (p. 87) may be significant. 


Even if the correlations of any such measure with the criterion 
are low, provided they are stable, it can contribute something 


to selection. 

Most of the objective tests fall into two classes, described in 
Chaps. V and VI respectively. The former are too simple and 
specific, for example, most of the f, p, and o tests. The latter, 
which approximate more or less closely to samples of everyday 
life behaviour, are too dependent on the subject’s attitudes and 
on the way they are put across, and, therefore, too readily 
distorted by his desire to display an acceptable personality. 
Nevertheless, there are some exceptions: body sway (p. 80), 
manual dexterity and bodily co-ordination (p. 80) are certainly 
connected with emotional stability; and the discrepancy 
between verbal and spatial-mechanical abilities (p. 74) has 
some significance. - More hopeful are indirect measures including 
handwriting pressure characteristics (p. 60), oral fluency and 
the verb/adjective quotient in speech (p. 56), muscular 
reactions in the Luria apparatus (p. 54) and, particularly, 
measures of tension and breakdown under stress in complex 
reaction, co-ordination and learning tests (p- 85). The Q-score 
in Porteus Mazes (p. 63) and ‘carefulness ° at group tests 
(p. 66) fall under the same heading. Persistence can be 
measured in complex performance tests and in resistance-to- 
discomfort, but an adequate battery of persistence tests tends 
to involve too elaborate arrangements and too much time to be 


practicable. $ 

It is a moot point whether a psychological tester s assessments 
of such traits as stability, impulsiveness, persistence, etc. based 
on observation of behaviour at apparatus tests (p. 66), would 
not be more reliable and valid than objective measures of such 
behaviour. Both might be employed. The various group 
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observation methods developed by War Office Selection Boards 
(p. 96) are highly acceptable to employers and candidates but, 
being far more complex, they involve more subjective interpre- 
tation on the part of the observer and allow more scope for the 
candidate to modify his normal behaviour. Thus they depend 
greatly on the skill of the particular observer, and are generally 
less valid than might be expected. By contrast, time-sampling 
(p. 94) or other techniques of recording specified categories of 
behaviour are highly objective, but are too time-consuming and 
elaborate to be employed in selection. A compromise might be 
worked out, where the situations would be more structured 
than in most group exercises used at present, and the recording 
scheme more standardized. 

Another interesting contrast may be drawn between grouP 
exercises and ordinary ratings by associates. The former make 
use of rather artificial situations, which are usually too brief to 
provide representative samplings of the candidates’ norma 
behaviour (though superior in this respect to the conventional 
interview situation), The latter are based on normal behaviour: 
usually observed over a considerable period, but in a casua 
rather than systematic fashion. The prime defect of ratings 1$ 
that they represent crystallized emotional attitudes of the raters 
towards the ratees, even when the rating scales attempt to 
emphasize observed behaviour. Observers of group exercises, 
or psychologists who judge qualitative behaviour at performance 
tests, are better able to maintain impartiality in comparing 
one candidate with another. Thus there is much to be said for 
arranging—where possible—trial periods of a week, a month 
or more, on the job (or in the case of children, at the school) for 
which they are being selected. Their performance during oD 
period should be assessed, not by generalized ratings, but A 
careful recording of specified types of behaviour; and thog 
should be made by a trained observer rather than by an oe 
ployer, head teacher, or supervisor. The objection will doubtles 
be raised that ‘ specified types of behaviour’ are too narrow) 
and that an overall picture of the candidate’s personal a 
social adjustment to the job is needed. But a great dea of 
evidence indicates that a few really reliable measurements a 
people provide better predictions of educational and ocen ir 
tional success than do subjective generalizations about ý 
personalities as wholes, 
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Although we have seen that ratings seldom contribute any- 
thing more than do school marks in educational selection, they 
can nevertheless be of value in occupational selection. It is 
most desirable that each rater should be acquainted with thirty 
or more candidates, in order that relative—not absolute— 
judgments can be used. Possibly, however, a carefully 
constructed forced-choice scale would overcome the distortions 
due to variations of standards among raters of small numbers. 
If neither of these approaches is practicable, the best alternative 
is for the rater to fill in a highly concrete third-person question- 
naire, accompanying this with a qualitative personality sketch 
or testimonial, and for the selecting psychologist to interpret 
this material so as to reach a final rating which will be com- 
parable from one candidate to another. The Vineland Social 
Maturity scale (p. 110) fits in well with this conception, and 
there is a need for other scales along the same lines. Several 
other points regarding the technique of rating are made in 
Chap. VII, for example, the desirability of getting judgments 
from raters with varied outlooks, and of making rather more 
use of ratings Or nominations by peers, less of ratings by 


superiors (teachers or previous employers, etc.). 
i of the selection interview should be 


Finally, the functions N ) 
severely restricted, for the reasons given m Chap. I, though it 
‘ith because of its acceptability to 


can seldom be dispensed w ceptal 
candidates and employers, and its greater practicability than 
It does have value in 


that of more objective techniques. 4 r 
marshalling the evidence about a candidate’s previous educa- 
and it can provide useful indica- 


tional or occupational career, 5 
uency, physical appear- 


tions of such traits as social aplomb, fl h 8 
ance, etc. Asa diagnostic technique for assessing personality 


in general, or for synthesizing data from all sources into a final 
judgment of suitability, it is—on the average—extremely unre- 
liable and invalid, although some interviewers (not necessarily 
psychologists or psychiatrists) are much better than others. 


RIMENTAL RESEARCH 


a method either praised or condemned in 
ld not be clarified by further investigation. 


ty field that the repetition, 


ersonali 
f previous researches would be 


EXPE 


There is scarcely 
this book which cou 
Results are so variable in the p 
extension, and interlocking © 
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useful than the eontinued construction of more or less 
Be tests. The concentrated effort which has gone into the 
improvement of social attitude scales, and made them into y 
genuinely useful instrument for sociological and opinion Se 
might well be applied to the measurement of miono 
variables, to time-sampling and systematic observation among, 
older children and adults, to interest classification and capes 
ment, to the selection and training of good interviewers, to I 
development of apparatus tests which will indirectly ah 
really significant personal behaviour, to the improvemen z 
ratings or measurements ‘ within persons ° and of ec 3 
for the quantitative treatment of patterns of scores, an 
host of other fruitful problems. 

Perhaps the Be need is for more studies like those g 
Hartshorne and May, Eysenck, MacArthur and others 
particular traits by the trait-composite, syndrome, and ete 
approaches, though unfortunately they have to be La cates 
elaborate. This is the only approach which provides aie 
factory criteria of the validity of different tests, rating or 
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main defects in the trait-composite approach—that it is t cross- 
sectional ’ rather than dynamic, and that it abstracts personality 
traits as though they were wholly properties of individuals. 
Direct recording of behaviour should help to reconcile trait- 
psychology with field theory and the study of social groups and 
processes on the one hand, and with ‘ longitudinal ° studies of 
personality development on the other hand. Although we have 
used the framework of trait psychology throughout this book, 
there is good reason to hope that some more fruitful system will 
eventually replace it. Obviously there are many other types of 
contemporary research—into experimental depth psychology, 
into the relations between perception and social and personality 
factors, ete.—which will help in constructing an adequate 
theory of personality, although at present they have little 
relevance to problems of personality assessment., 


DIAGNOSIS AND GUIDANCE 


The testing or assessment of personality for purposes of 
diagnosis, guidance, treatment, or control, is much the most 
intractable problem. For there is neither an external criterion 
of its value (as in selection), nor an internal criterion—the 
consistency of the results with one another (as in experimental 
research). The success of vocational guidance has indeed been 


followed up, but it is not 
particular element in 
diagnoses among me i F 
actually leads to such variable results that it is hardly possible 


to dissociate the test from the tester. i 
does well with interviewing oT with Rorschach, another with 


Thematic Apperception or drawings, another with deterioration 
tests or expressive movements, and so on. Hence in order to 
h of these or other methods, the diagnostician 
ided by the selector or the 
is much too apt to trust to 
the face-validity of his case d to his own experience ; 
these may be supremely good in individual instances and worth- 
less in others. The lack of any generally accepted framework 
for the description of person f further difficulties ; 
the graphologist, Rorschachite, Freudian, OT the average non- 

different 


medical vocational psychologist, 
14 


experim ! 
-studies an! 


SS 
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ages. That is why we have stressed making the maximum 
Es small eines of operationally defined composite 
variables, in spite of their apparent narrowness. 

The diagnostician is often less limited by time and cost than 
the selector, and may well spend a dozen hours on a single case 
if he applies Rorschach, T.A.T., Minnesota Multiphasic, ance 
biographical interview. Kelly and Fiske’s results throw t s 
gravest doubts on this intensive approach, and justify som 
resort to instruments which have received more objective 
validation, even though they seem less helpful in proita 
insights. Any of the techniques mentioned earlier in t r 
chapter under selection could play a useful part in gudar 
since any which have been successfully validated provide a 
clinician with checks on his intuitions. In addition, personality 
inventories are more worth while than in selection, because S 
the generally better motivation of candidates for guldanen 
Incidentally there is no need for questionnaires to be as lonh a 
MMPI or Strong’s Blank, even if these are at present the bes e 
their kind. Needless to say, their results must be interpre ul 
with caution, but at the same time the psychologist shaw 
remember that his main object in using them is to corr he 
biases in his own subjective summing-up of the personality Š 
is studying. Another step which would be salutary FA i 
vocational, psychiatric, or other clinic with a ‚large ine 
would be for different members to carry out the intervie ae 
and the favoured projection or other tests involving soba ove 
judgment, instead of expecting a single psychologist to pe is 
his diagnoses by applying several techniques. Finally, 25 who 
no evidence that psychiatrie interviewers or psychologis an 
make use of the concepts of abnormal psychology ent 
more consistent or capable of making more valid ook an 
or vocational predictions than those trained in on oac 
applied psychology. However valuable the medical app. little 
may be in therapy, the bulk of research shows that it has 
to contribute to selection or guidance. el 

Thus we conclude as we started, that the testing or ae 
of human personality is fraught with so many seg ee. ua! 
more complex indeed than any other problem in in p: ycho- 
psychology—that even the application of the highest p ected 
logical skill and technical accomplishment cannot be exp 
to bring about rapid success. 


ent 
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Scaling of marks and ratings, 105-6 
Schemata, 101-3 
Schizothyme, see Cyelothyme. 
School teachers’ judgments, 8, 24-6, 
. 105, 111-12, 114-16, 121, 203 
Screening tests for recruits, 125, 
142-3 


Selectivity, effects on correlations, 
26-7, 29 


Self-confidence, 11, 88-9, 128, 132, 
139 

Self-description test, 179 

Self-ratings, 122, 128 

Sensory defects, 38-9 

Sensory tests, 68, 75-6, 79 

Sentence Completion test, 27, 171, 
177-9, 200 

Sex differences, 10, 40, 58, 89, 119, 
133, 169 

Shipley-Hartford test, 72 

Situations, see Group observation 
methods. 

Sociability, 2, 5, 11, 15-16, 36-7, 41, 
47, 92, 94-5, 132 

Social attitudes, see Attitudes, 

Social factors, effects on Personality, 
4-5, 7, 84, 205 

Social intelligence, 92, 200 

Social maturity, 109-11 

Sociodrama, 196 

Sociometry, 29, 96, 114 

Somatotypes, 37-8, 201 

Sorting tests, 74-5 

Spearman-Brown prophecy formula, 
117 

Speech, see Voice. 

Speed, 68-70, 81 

Standards, variations of in rating, 
104-12, 117, 208 

Stanford-Binet test, 61, 73-4 

Statie ataxia, 80, 201 


220 

Statistical techniques in personali 
study, 19 ER 

Stereotypes, 17, 46, 48-9, 56, 102, 
152, 157 a 228 

Story-telling tests, 52, 179-80 

Stress tests, 66, 85-7, 103, 201 

Strong Vocational Interest Blank, 
28, 27, 111, 161-9, 174, 197, 199, 
206 en 

Suggestibility, 77, 80, 87-8 

Suggestion in personality question- 
naires, 138-9 

Surgeney, 18, 15-16, 56, 70-1, 
197 j 

Sutton Booklet, 125, 129, 175-6, 
200 

Sympathetic nervous system, 41 

Syndromes, 10-11, 17-18, 204 

Synthetic-analytic types, 80 

Szondi test, 185- 


T.A.T., see Thematic Apperception. 

Temperament, 2-3, 33, 85-41, 68, 
76, 81-2 

Terman-Merrill test, 73 

Testimonials, 20, 119-21, 203 

Tetanoid-Basedowoid types, 42 

Thematic Apperception test, 27-8, 
161, 180-4, 188, 193, 200, 205-6 

Thurstone’s attitude scales, 145-54, 
156-7 

Thurstone’s Personality Schedule, 
125, 129, 131, 141-2 


Time-sampling, 23, 94-6, 102-3, 
‘196, 202, 204 
Track Tracer, 79 
Trait-composite technique, 7-11, 


16-18, 22, 45, 63, 90-1, 94, 116, 
141, 157, 204-6 

Traits of personality, 4-16, 101-8, 
115-16, 144, 171, 199, 204-5 
Trend tests, 66 
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eS ‚ol personality and tempera- 
ment, 16-18, 35-40, 80, 194-5 


: Typė/Token ratio, 57 


Unconscious mechanisms, 3-5, 9, 
. 89, 51, 53, 59, 101, 139, 144, 
170-2, 183; 185, 188, 193, 195-6, 
206 


Validation of personality tests, 9-10, 
21-2, 31, 58, 98, 141-3, 199, 204-5 

Values, types of, 17, 162 

Variability, 68, 73-4, 78-81, 85-6, 
201 

Verb-Adjective ratio, 57, 201 

Vigotsky Blocks, 74 

Vineland Social Maturity Scale, 
71, 110, 208 

Visual-Motor Gestalt test, 61, 171 

Vocational selection and guidance, . 
1, 14, 16, 18-21, 23-6, 28-30, 38, 
48, 84, 91, 97-100, 119-21, 131, 
134-6, 163-6, 183, 190, 199-206 

Voice and personality, 9, 19, 49-50, 
52-8, 55-7, 63, 65, 69, 201 


War Office Selection Boards, 97, 
120, 202 
Watson-Glaser 
Thinking, 160 
Watson’s test of* Fairmindedness, 
158-60 
Wechsler-Bellevue test, 72-3 
Weigl Sorting test, 74 E 
Willoughby’s Emotional Maturity 
Scale, 109, 128 
Will-temperament tests, 68, 81 
Woodworth’s Personal Data Sheet, 
124-5, 129, 139, 141 j P 
Word association, 20, 23, 42-4, oira 
70, 90, 168-9, 172-5, 177-8, 
Word Connection test, 177, 20 
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